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This is the first version of SYSTAT programmed completely on 
the Macintosh. Mark Bjerknes has completed a project of several 
years’ duration on one of the largest Mac applications to date. Greg 
Mullins who joined SYSTAT after the project began, helped with 
numerous new features. Tom McHale worked with Greg to develop 


the Spin program. 


This is also the first major revision of the SYSTAT and SYGRAPH 
documentation. MaryAnn Hill, who joined us this year as Director 
of Technical Services, is primarily responsible for this reorganiza- 
tion. I think it is the most usable manual we have produced. 


The documentation could not have been completed without Erin 
Vang and Stacey Miceli. Erin wrote much of the material in Getting 
Started and produced page layouts. Stacey developed art work and 
contributed to the SYGRAPH manual. 


Chris Soderquist, Pat Fleury, Mike Pechnyo, Amie Wilkinson, and 
others helped with quality control. Eve Goldman, Kns Yasurake, 
Stephanie Shishko, Lucy Saunders, and Tracy Koe supervised mar- 
keting and producton. 


Despite the technical difficulties of producing software of this 
magnitude for a machine like the Mac, we are all excited by the di- 
rection it has taken us. I want to thank all our Mac users and beta 
testers for suggestions which improved this version so enormously. I 
like our new look. 


Leland Wilkinson 
SYSTAT, Inc. 
Evanston, IL 
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The first section of this chapter tells how to install SYSTAT and 
SYGRAPH on your Macintosh. Please take the time to look over our 
instructions even if you have installed SYSTAT products before. 


The second section covers some technical details; you can probably just 
scan the section for now. 


The third section, “Overview of documentation” gives an overview of 
this manual and the other three volumes, DATA, SYGRAPH, and 
SYSTAT. We discuss the organization, notation, and objectives of the 
manuals. 


The final section, “Macintosh basics” is a brief survival guide to the 
Macintosh that might be useful if you have never used a Macintosh 
before or if our simple, step-by-step instructions don’t seem so simple 
or step-by-step. 








Installing SYSTAT and SYGRAPH 
eee 


New features If you have used previous versions of SYSTAT or SYGRAPH for the 
Macintosh, you will no doubt find this version very exciting. 


Fully menu-driven operation 
We now have a fui/ implementation of the Macintosh point-and-click 
interface. You can do ail of your work through menus and dialog boxes. 
SYSTAT and SYGRAPH have special menus with icons to remind you 
what all the different types of analyses and graphics look like. Both these 
menus can be torm off to sit ready anywhere on your desktop. 


Dialog boxes remember your choices 
All dialog boxes remember your choices from one procedure to the next, 
so if you want to—say—recompute a regression model without one 
variable, all you need to do is leave everything the way it was but remove 
one variable. If you want, you can save all of your dialog box settings 
from one session to the next. 


Optional command interface 
If you are an experienced SYSTAT or SYGRAPH user, you will be 
happy to know that you can still use commands if you want. We provide 
an optional command interface that even mouse-addicts will appreciate: 
you can use it to keep an audit trail of your work, to do repetitive analy- 
ses in batches, or to save time doing complex operations. 





One integrated environment , 
All of SYSTAT’s procedures and SYGRAPH are now one program. (No 
more switching modules.) SYSTAT and SYGRAPH share a Data 
Editor and data files. They also share utilities for manipulating and 
transforming data. Both programs are integrated to share one menu bar 
and to work simultaneously and interactively with the same data files and 
the same windows. Despite this integration, Macintosh SYSTAT is still 
fully compatible with SYSTAT on any other platform—minicomputer, 
MS-DOS versions, etc. 
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Links between analyses and graphics 
In addition to all of SYSTAT and SYGRAPH being integrated, we have 
added HyperCard-like links between the Data Editor and the graph 
output window: scatterplot brushing tools let you see the relationships be- 
tween your data and your graph visually, instantly. You can select and 
extract data visually, instantly. You can follow one case in the Editor 
through many plots in a SPLOM visually, instantly. 


3-D plot spinning 
You can spin three-dimensional scatterplots in real time. Animated 3-D 
plots could save you hours of computation and redrawing plots from 
different perspective. SYGRAPH even tells you what [Eye] setting to 
use to reproduce a perspective with a final, presentation quality graphic. 


Instant help 
You can get general help from a special Help window available through 
the Apple menu. You can get instant, context-sensitive help for any 
menu item or any dialog box or any. button or any window—at a click of 
the mouse. You can get a quick review for any term or statistic in the 
output from an analysis. 


Full color 
If your machine supports color, you can draw graphs in full color. 


Drawing tools 
MacDraw-like drawing and writing tools let you customize your graphs 
in full color. 


installing — To install SYSTAT and SYGRAPH, insert Disk #1 into the floppy disk 
drive. Double-click the disk to see its contents. Then, double-click the 
Install application and follow the instructions it gives. 


Viruses Our products are extremely sensitive to viruses. One possible scenario: 
you install everything, you start SYSTAT up and everything goes fine, 
and you Quit. The next time—or a few times later—that you go to run 
SYSTAT, it won’t start. You probably have a virus. 


© 1989, SYSTAT, Inc. 3 





Installation 





Installing SYSTAT and SYGRAPH 


Here are some ways to check whether you have a virus: 


e Use virus-detecting programs such as Vaccine or Ferret; these detect 
many common viruses; contact your Macintosh dealer for help 

e Check whether the file size of the SYSTAT application has increased 
since you installed (use File/Get Info from the Finder’s menu) 

© See whether the modification date in the Get Info screen has 
changed to a date when you ran SYSTAT 


Our release disks do not contain viruses. All of our development and pro- 
duction machines are isolated physically and we take numerous precau- 
tions to ensure that they cannot become infected. 


If you think your machine has a virus, use ResEdit or Interferon to in- 
vestigate and eliminate viruses. Your Macintosh dealer can help you. 
Contact our Technical Support department if you have further ques- 
tions. 
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Each procedure (including DATA) dynamically allocates memory for 
variables. The only way to discover the limits of the procedure is to try a 
large problem. In many procedures, the limits are on vanables, not on 
cases. Almost every routine in SYSTAT can process as many Cases as 
you can fit on your disk. The exceptions are some nonparametric and 
rime series routines which store all the data in internal memory. The 
Data Editor can process the maximum number of variables allowed in a 
SYSTAT file. Not all the statistical routines can handle this many 
variables, however, because they use additional memory to process the 
analyses. 


Processing time varies with machine. The input and output routines are 
slower than the statistical routines because SYSTAT must look for char- 
acter variables and missing data and must convert numbers to characters 
before printing them. While this slows printing, it ensures that 

SYSTAT produces exactly the same results on any computer. In addi- 
tion, the BASIC language provided by DATA is slow, since it interprets 
commands when they are executed. Again, the trade-off is that SYSTAT 
BASIC produces identical results on different machines. For most appli- 
cations, DATA is slower than the statistical rounnes. 


Disk space required varies with routines. Many routines use scratch files 
on the disk. Sorting, in particular, uses several extra files. The scratch 
files are stored in double precision, meaning at least 8 bytes for each 
number and 12 bytes for each character value. See Appendix I of the 
DATA manual for information on the file structure of SYSTAT files. 


In addition to your data, SYSTAT files store variable names, the version 
of SYSTAT that created the file, and the file type. All versions of 
SYSTAT can read files created by previous versions of SYSTAT for the 
same operating system. The reverse is not always true. SYSTAT fle 
types include rectangular (the usual), SSCP, covariance, correlanon, 
dissimilarity and similarity. Not all SYSTAT procedures can process al] 


types of files. 
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Technical notes 


SYSTAT stores files in binary form so that it can process them much 
faster than it could raw data files. This means that you cannot directly 
transfer SYST AT files between machines with different architectures 
and operating systems. You can transfer data files between machines 
through plain text (ASCID files or with MACPC. 


Note: Appendix IT in the DATA manual, SYSTAT file Structure, de- 
scribes the internal format of SYSTAT files and provides FORTRAN 
subroutines to process them. 


Supplementary modules available for SYSTAT and SYGRAPH provide 
highly specific data analysis or data management tools that are not part 
of the basic SYSTAT package. Current SYSTAT supplements include 
DESIGN (power analysis and expected mean squares), EZPATH (causal 
modeling), LOGIT (multinomial logistic regression), PROBIT, 
TOBIT, 2SLS (two-stage least squares), TESTAT (test item analysis, 
Rasch estimates), SETCOR (set correlation analysis), SIGNAL (signal 
detection analysis), and SURVIVAL (survival, reliability, and life tables 
analysis). Not all supplementary modules are available for every 
implementation of SYSTAT. — 
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These manuals are intended to serve both as reference guides—with 
concise, step-by-step instructions for using menus and dialog boxes— 
and as statistics and graphics handbooks. We hope that you will take the 
time to read each chapter carefully when you can, and we hope that you 
will be able to find information efficiently when all you have time to do 
is look up a procedure and go. 


We have divided the documentation into four volumes: Getting Started, 
DATA, SYGRAPH, and SYSTAT. 


This first book, Getting Started, is for just that: getting started with the 
basic program. You will learn how to use the SYSTAT Data Editor, 
how to handle files, how to save and print stuff, how to use the menus 
and dialog boxes, and how to do your statistical analyses and graphics. 


Getting Started shows you how to use everything in the File, Edit, Data, 
Goodies, and Editor menus. 


The next chapter is a Tutorial that will give you the basic feel of the 
program. We also present a data handling tutorial in the Data Editor 
chapter, a scatterplot brushing tutorial in the Exploratory data analysis 
chapter. And throughout this and the other manuals, we show you lots 
of examples that you can try—complete with data files, step-by-step in- 
structions, and output. 


This is the fun book. We show you how to spin plots. We show you 
how to use scatterplot brushing tools to link numbers to pictures. We 
show you how to get SYSTAT to do your work for you with instant 
graphs and statistics. 


The last chapter, Tips and shortcuts, gives some hints for really flying 
through your work. 


The DATA manual discusses the powerful DATA programming lan- 
guage that you can use for complex data manipulations. Most users will 
never need to open the DATA book, because you can accomplish so 
much with the SYSTAT Data Editor and the Data and Editor menus. 





Installation 


SYGRAPH 


SYSTAT 





Overview of documentation 


Your best bet is to set DATA on the shelf for a few weeks. After you are 
comfortable using SYSTAT and SYGRAPH and the Data Editor, flip 
through DATA and decide for yourself whether you need to tap 
DATA’s resources, 


The SYGRAPH manual shows you how to use SYGRAPH: in short, all 
the plots, graphs, and charts in the Graph menu. You should read 
Getting Started first, though—that’s where we show you how to handle 
the menu and dialog box system, how to size, position, and overlay 
plots, how to show Movies of your plots, how to spin plots, and how to 
use scatterplot brushing tools. 


The SYGRAPH manual has a separate chapter for each item of the 
Graph menu, and the chapters are organized according to graph type. 
Within each chapter, we show you first the major variations (the menu 
subitems) and then show how you can use the options to produce differ- 


ent types of graphs. 


The SYGRAPH manual is more than Just a software document telling 
what to click when. It’s also a graphics handbook, containing much in- 
formation about using graphs effectively. Take the time to read the 
manual. 


The SYSTAT manual shows you how to use SYSTAT: all the proce- 
dures in the Stats menu. Read Getting Started first to get the feel for the 
program and learn how to print, save, and handle data. 


The SYSTAT manual has a separate chapter for each item of the Stats 
menu, and the chapters are organized alphabetically. Within each 
chapter, we have four sections: an introduction to the procedures, 
“Usage” instructions, technical details about “Computations,” and 
“Examples” that you can try. 


The SYSTAT manual, too, is more than a software document. It is a 
statistics handbook, with introductions for every class of procedures, 
advice, and examples. You will learn to use SYSTAT—and statistics— 
wisely by taking the time to study the SYSTAT book. 
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Command references 
The next three volumes, DATA, SYGRAPH, and SYSTAT, each have 
command references as their first appendix. These appendices give in- 
structions and detailed syntax information for using the optional com- 
mand interface. 


Indices 
Each volume has a comprehensive index of topics, menu items, dialog 


boxes, options, functions, etc. 


Notation Each chapter in every manual begins with an overview of the chapter, 
and a table of contents for the chapter appears on the facing page. 


Headers 
At the top of every page, we show the name of the section and the name 
of the chapter. Chapter titles always are shown in green in the outside 
comer. 


Examples 
In all the manuals we have numbered the examples within each chapter: 
3.1, 3.2, 3.3, etc., where 3.2 is the second example in Chapter 3. For 
SYSTAT and SYGRAPH, we present an appendix of Command equiva- 
lents for examples that shows how to reproduce the examples using the 
optional command interface; the examples are organized by chapter and 
marked with their numbers.in the chapters. 


roy oer ey err cy. Pt ie 


Instructions 
When we show instructions for using menus and dialog boxes, we use 
bulleted lists of actions. Actions that are strictly required have solid bul- 
lets (¢), and optional actions have open bullets (¢). 


e You must do this 
© You might not need to do this 


We show typed input (for the optional command interface) in a type- 
writer font: 


THIS IS A COMMAND 
All mouse instructions and command input appear in green, to make 


them easier to find. 
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Menu items 
The names of menus and menu items always appear in boldface, e.g. the 


Corr item from the Stats menu. 


Some items in the Stats and Graph menus have subitems. For instance, 
Bar in the Graph menu has four subitems: Bar, Percent, Range, and 
Stacked. We sometimes abbreviate by saying “select Bar/Range from 
the Graph menu.” 


We also sometimes abbreviate regular menu items, e.g. 
Data/Formats... means “the Formats... item from the Data menu.” 


Options 
We enclose the names of options—buttons, checkboxes, icons, etc.—in 
square brackets. For instance, you click the [Text] button in the Save 
as... dialog box if you want to save your data in a plain text file. | 
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If you have ever used a Macintosh before, you probably don’t need to 
read this section. Most Macintosh software uses a consistent system of 
menus, windows, icons, and mouse techniques. The following does not 
tell all there is to know about the Macintosh interface, but it reviews the 
basic ideas of Macintosh usage. If any of the following paragraphs leave 
you confused, consult the Macintosh System Software User's Guide that 


came with your machine. 


The top strip of the SYSTAT screen is a menu bar that contains menus 
of things you can do. Each menu contains items that accomplish certain 


tasks. 


menu Dar 
Analysis (text) window View (graph) window 


(“& Pile Edit Data Graph Stats Goodies Editor ——_// 
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Editor window 
Command window 





Macintosh software works with a system of windows. Many things can be 
going on at once, and you can switch around between very different 
types of tasks, because each task happens in its own window. 
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Macintosh basics 


In SYSTAT, data are entered, manipulated, and viewed mn the Data 
Editor window. Statistical analysis results are displayed in the Analysis 
window. Graphs are drawn in the View window. You can spin plots in 
the 3-D Window. If you choose to use the optional command interface, 
you type commands in the Command window. 


At any given moment, one window is active and the others are dormant 
behind the current window. You can only work with the active window. 
To make a window active, click on it. 


You can resize, relocate, and close windows. To resize them, click the 
zoom box in the upper right corner or drag the size box in the lower right 
comer. (I'o drag something, point the arrow at the jtem. Press and hold 
the mouse button, move the arrow to a new location, and release the 
mouse button.) 


file name 
close box title bar Zoom box 


SYSTAT Data Editor 


» 


weadran es 





To move a window, drag its title bar at the top. To close a window, click 
the go away box or the close box in the upper left corner. 


When a window contains more material than can fit in the window, the 
window scrolls—earlier information rolls off the top or off the left to 
make room for more. To scroll a window one line or column at a time, 

click the arrows at the extreme ends of the scroll bars. To scroll one 

page (windowful) at a time,click in the shaded paging area of the scroll 

bars at the left and bottom edges of the window. To make larger leaps, 

drag the thumb (the white box) to a new position. 
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To use a menu item, pull down the menu and release the mouse button 
on the item you want. Some menu have Command-keystroke abbrevia- 
tions listed to their right on the menu; you can access these items by 
holding the Command (38) or Apple key and typing the letter. For ex- 
ample, Quit is abbreviated 36-Q. 


If you ever select an item from a menus, get a dialog box, and then 
decide you don’t want to do the procedure, just click Cancel. You can 


then try something else. 


You can use the mouse or cursor arrow keys to move around in the Data 
Editor and in the text windows. Press any arrow key once to move one 
row or column in that direction, and hold it down for repeated move- 
ments. 


You can type numbers from both the numeric keypad and the top row 
of the standard keyboard. 


—s 
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Overview Welcome to SYSTAT! Although SYSTAT has more features than some 
popular mainframe statistical packages, you can begin using it right 
away. [he chapters in this section introduce you to SYSTAT and the 
features, definitions and operations that are common to all modules in 
the SYSTAT system. 


Before you begin, you should already know how to set up SYSTAT on 
your computer. If you have not yet done so, please read the installacon 
and operating-system specific instructions in the previous chapter. 
There you will find how to make backup copies of the program, install 
SYSTAT on a hard disk drnve, and launch SYSTAT. 


This tutorial takes you through a sample SYSTAT session. We use a file 
that has already been prepared. (The Data Editor chapter shows you 
step-by-step how to create your own SYSTAT data files.) 


eres 
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Open a file Let’s start with some statistical analyses. First, we open a file. 


e Select Open... from the File menu 


| @MTCGM Edit Data Goodies 





Import... 

{lose 

Save 

Save as... 

Save selected tent... 
Page setup... 


PAE Preuie ts... 
Print selected teat... 
Print qraph... 
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Submit commend file... 
Submit clipboard FF 


e Select the Data Files directory (double-click Data Files, or click Data 
Files and click OK) 











[es 4.0sysTAT] 4.0SYSTAT 
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© Subroutines | 
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e Select the file USDATA 
e Click Open 
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Now, you should have a full menu bar. 


& File Edit Data Graph Stats Goodies 


You are now ready to do statistical and graphical analyses. Since you 
should always look at your data before doing analyses, we are going to 
star with some graphics. First, we need to see the Graph menu. 


The Graph and Stats menus are both teav-off menus. You can tear them 
off the menu bar and park them anywhere on your desktop. Your mouse 
saves on mileage, because you don’t need to keep going up to the top of 
the screen and pulling through the menu to select an item—you just 
press, slide, andrelease. ~ 


Graphics For now, let’s just tear off the Graph menu. Press and hold the mouse 
| button on Graph in the menu bar. The Graph menu drops down. Keep 
holding the mouse button. 
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é File Edit Data Biigiim Stats Goodies 
Ls?| | S 
3-D Plot] Bar Box 
=| bool | 
Gg) 
Icons Pie 
ne 7 a 

Plot Prob'y | Quantile 
EEE, 

hal 


Now, slide the pointer through the menu and off one of its three sides. 
(Don’t release the button yet!) Notice the marquee that trails your 
pointer. 


é¢ File Edit Date Bdge Stats Goodies 


sa a = 
pagent 

~~ e/ ? — 
3-0 Plot Box 






















i 
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Place this marquee where you want to park the menu and then— 
finally!—release the mouse button. 
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€ File Edit Data Graph Stats Goodies 
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© 
loons Pie 












ee ee 
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Voua! Your very own torn-off Graph menu! Note that the menu looks a 
lot like a window. Sure enough, that white box in the upper left corner 
is a close box. The black strip is a title bar. You can close and move the 
menu just like you’d expect. 
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Scatterplot 
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To use the menu, press and hold the mouse button on any of the items, 
e.g. Plot. 











r hal > y 
Categoryf Density | Function 
ogiC7) @ 

© 

icons Pie 
ca Evade 
Flt = a 

au = a 
Fhe Border | Contour { Fourier 
SPLO se 

t “1 Ey 

=| EX 
High-low | Influence] Parallel | Voronoi 








A submenu pops up offering variations of that type of plot, e.g. Plot, 
Border, Contour, Fourier, High-low, Influence, Parallel, and 
Voronoi. Slide the pointer to the item you want (Plot) and release the 


mouse button. 


¥Y variables H variable 


STATES STATES 
REGIONS _| | REGIONS 
POPDEN 4 | POPDEN 
PERSON i] | PERSON 
PROPERTY =| | PROPERTY 
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Tutorial 


Let’s use this dialog box. We want a scatterplot of CARDIO and 
PULMONAR, two of the variables in the USDATA file. 


@ Select the Y (vertical axis) variable: scroll through the [Y variables] 
selection list until you find CARDIO; click on it and then click 
[Select]; or, double-click CARDIO 

e Select the X variable: scroll through the [X variables} selection list to 
find PULMONAR, and either click it and click [Select] or double- 
click it 

e Click OK 


After a brief pause, a View window will appear on your desktop with the 
graph: 





This plots the reported incidence of cardiologically related deaths versus 
pulmonary deaths for the 50 U.S. states in 1982. Each square represents 
a single state. Numbers within the plot indicate the number of states 
that have similar values for these two variables and plot at the same 
point. | 


The Plot/Plot dialog box you just used has many options in the lower 
right corner. Notice, however, that all you bave to do is select Y and X 
variables. 
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Statistics 


Descriptive 
Statistics 
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Now let’s try a SYSTAT procedure. First, tear off the Stats menu the 
same way you tore off the Graph menu. (You don’t have to tear off the 
menus; we just think they’re easier to use when they are sitting ready.) 








e —, 


ge 


av?| pia? 


LS ner 
= Series 
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Ready? Okay. Let’s select Statistics... from the Stats item in the lower 
left corner of the Stats menu. (Stats is the only word that is both a 
menu and a menu item and—almost—a menu subitem; nothing else will 
be confusing this way. We promise!) 







» 


| rama 
Baal One-way ANOUR.... 


| = Statistics... 


Fur: 


a +. iv 4 
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Tutorial 


The Statistics... dialog box looks like this: 


(7) STATES 


REGIONS 
REGION 
| DIVISIONS 
DIVISION 


JP rege ro qe 


O cy Bd sp 

(1 Kurtosis ()SEM 
Maximum [] Skewness 
&] Mean [] Sum 

BJ Minimum [XJ Variance 
KJ] N 

C] Range Cj All 

CL] Save statistic 


merger: - +9 





Let’s get some descriptive statistics on five of the USDATA variables. 
Notice that some of the statistic options are already checked: 
[Maximum], [Mean], [Minimum], [N], [SD], and [Variance]. These 
statistics are the defaults: the things you get automatically if you don’t 
request something different. 


e Select the variables: ACCIDENT, CARDIO, CANCER, PULMONAR, 
PNEU_FLU 
e Click OK 


ee a F 
se, a a 


s 


a> Pechgge tinea, hes 
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Regression 
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Total Observations: 50 


ACCIDENT 
N of Cases 50 
Minimum 28.600 
Maximum 85.800 
Mean 44.312 
Standard Dev 10,983 
OLABETES 
N of Cases 50 
Minimum 3.400 
Maximum 26.200 
Mean 14.836 
Standard Dev 3.668 


CARDIO 


i}4 


84 


LIVER 


50 


. 600 
508. 
398. 


700 
530 


244 


50 


. 200 
.700 
.548 
745 


CANCER 


50 
76.100 
244.000 
178.410 
33.051 


PULMONAR PNEU_FLU 


50 50 
8.300 12.400 
36.300 32.400 
26.486 21.038 
5.602 4.140 


These are the default statistics. If you want other statistics or only a few 
of these, check the ones you want and uncheck any that you don’t want. 


Some graphic and statistical procedures have more subitems and options 
than others. You need not worry, however, about learning all the details 
before using a procedure. You can do almost any procedure simply, 
saving tricky options and variations for when you have more time and 


more experience. 


Let’s illustrate this point by using the most complicated menu item in 
the whole package: MGLH. 


e Select MGLH/Regression... from the Stats menu 


STATES 

| REGIONS 
REGION 
DIVISIONS 
DIVISION 


Independent 
CONSTANT 


[] Save residuals 
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Now let’s do a simple regression of CARDIO on PULMONAR. You 

may notice that our data are not ideally suited for this model because the 

variances in CARDIO are not constant at all levels of PULMONAR 

and because there are dependencies across observations due to similar 

death rates in adjacent states. SYSTAT can be used to analyze more 
complex regression models suitable for data of this kind. Nevertheless, 
to keep this example simple, do the following: 


e Select CARDIO from the [Dependent] variables selection list 


® Click {Indep’dent] and select the independent variable and select 
PULMONAR 
| e Click OK 


| Notice that SYSTAT automatically included a CONSTANT term in 

t the [Indep’dent] selection list. If you ever want to estimate a model 
without the CONSTANT (which we don’t recommend), just select it 
with the mouse and press the Delete key. 


Here is the output: 


Dep Var: CARDIO N: 50 Multiple R: .441 Squared Multiple R: .194 
iia Adjusted Squared Multiple R: .178 Standard Error of Estimate: 76,397 
Variable Coefficient Std Error Std Coef Tolerance is P(2 Tail) 

CONSTANT 222.932 52.716 0.0600 1.0000000 4.229 0.000 

PULMONAR 6.630 1.948 0.441 1.0000000 3.403 0.01 


Analysis of Variance 
Source Sum-of-Squares OF “Mean-Square F-Ratio Pp 


Regression 67598 .890 L 67598 .890 11.582 0.001 
Residual 280153.975 48 §836.541 


22M Peg Do 


Mita 
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Help 3 


Overview Help for SYSTAT and SYGRAPH is never more than a click away. You 
can get on-screen help for any type of analysis or graphic, any menu 
3 | item or subitem, any dialog box, any button, any check box, or any se- 
lection list. See the “Help” section. 





If you choose to use the command interface, you can get help for the 
commands, See the “Command help” section. 


Pe em 2 ieee arr > 


Further, you can get information about any statistical term or title in the 
Analysis window output. See the “Information” section. 


“agin "yp ingles: , 4 7 


41 “aR Bre Sa 
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Help 





You can get help for anything in SYSTAT and SYGRAPH. 


General help: From the Apple menu, you can get general help for any 
type of analysis or graphing procedure along with any of the utilities for 
data selection, weighting, analysis by groups, and so on. See “General 


help,” below. 


Dialog help: You can get help for any dialog box by clicking the ques- 
tion mark button in the upper left of the box. 


Instant help: Finally, you can get instant help for menu items and 
subitems and for options in dialog boxes by pressing the mouse button 
and then holding the question mark (?) key on your keyboard. Bnef in- 
structions pop up for the thing you are pointing at—a button or check- 
box, a selection list, an icon, a menu, whatever! See “Instant help,” 
below. | 


The Editor menu has an Help item that provides help for using the 
Data Editor. See “Editor help,” below. 


General help For general information about SYSTAT and SYGRAPH procedures, 
use the general help feature in the Apple (@) menu at the far left of the 
menu bar. Select the first irem in the menu, About SYSTAT... 


The following appears. Click the [Help] button. 


SYSTAT 5.0 for Macintosh. ‘ 


Copyright 1989 by SYSTAT, Inc. All J 
rights reserved. 


Leland Wilkinson, Mark Bjerknes, 
Greg Mullins, and Tom McHale 3 


— ed 

[| oK (Help } 

- . | Help | 
ae ST Ter 
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Help Help 


The text window shows an introduction. For further topics, you can do 


one of three things: 


1) Scroll through the vertical list of topics at the right. Click the topic 
you want, and you will see information for that topic. 


2) Scroll through the horizontal list of icons at the bottom. Each icon 
represents an item in the Stats or Graph menu. Click the icon for the 
procedure or type of graph you want, and you will see information for 
that menu item. 


3) Scroll through the text window itself. Topics are organized alphabeti- 
cally. ‘The two selection lists are shortcuts to the section pertinent to the 
item you select. 


il ee A ed situa ees ge 


When you are done using help, you must click the close box on the Help 
window. You cannot resume working until you have closed Help. 


3 Rene = + 


Text list 
The text list shows a list of topics. If you click a topic, the text window 
jumps to its information about the topic. 


Suppose you want information about function plots. Scroll down the list 
until you see the topic, and then click it. The text window jumps to its 
information about function plots. 
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Help —. Help 


You can also locate topics by typing the first few letters; the text list 
jumps to the first topic beginning with those letters. 


SYSTAT Help 


Function 

{ Function lets you plot mathematical functions in two 
and three dimensions. Just specify the equation of 
function you want to plot. SYGRAPH will analyze 
your function, determine axis values, and plot it in = ft: 
smooth lines or surfaces. You do not need to generate fii! 
sample values to produce the plot. fii 


Ts 


Fx. a ry $ 

| a” ot bd a a > 
ee ES a i : : 
ed Tae : “4 Aer 
Jigs tee Factor i Series 


re rre tata te 
ateserte 





{ Icon list , 
| The icon list shows the icons for Stats and Graph menu items, orga- 
3 nized alphabetically, Stats first and then Graph. 


You can scroll through the list to see Graph items. You could get the 
same information about function plots: just click Function, and the text 
window again jumps to its information about function plots. Notice also 
that the vertical list of topics jumps to the corresponding topic. 


er mee Aa ng fp ag et As —lte._- ‘elnlin dlrerretlS maiet ite Rl irri oe 
‘ 
. 
So: *- - 
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Help 


Dialog help 
? 


Help 


Function 

RHEE AEE 

Function lets you plot mathematical functions in two 
and three dimensions. Just specify the equation of 
function you want to plot. SYGRAPH will analyze 
your function, determine axis values, and pict it in 
smooth lines or surfaces. ‘You do not need to generate 
sample values to produce the plot. 


3-D Plot 


“ann 
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Text window 
The topics are organized alphabetically in the text window. Feel free to 
browse-—just use the scroll bars. 


Every dialog box in the Stats and Graph menu and many Data and 
Editor menu dialog boxes (the ones that perform some procedure) have 
a question mark button in the upper left corner. Click the button to re- 
ceive quick general instructions on how to use the dialog box. 
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Help 


For instance, the help button in the Editor/Math... dialog box looks 
like the following: 


® math.. lets you create new variables or 

* transfom existing variables aigebraically. 
Fill in tle Set statement with any combination 
of selerding and typing; click GK to start the 
transfomation. 


Variable Variable or expression 


set [Losses __]to 





By pressing with the question mark key, you can see brief help for any 
menu item or anything in a dialog box. 


Suppose you're staring at the Graph menu and you just can’t remember 
what Plot is for. Just point at the Plot item, press and hold the mouse 
button, and then hold the question mark (?) key on your keyboard. A 
quick reminder pops up: 





You can also get help for any menu subitems. For instance, you can use 


°-click to find out about Plot/Influence: 
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Influence plots show LE eo Ke 2 

size the influence of each point on hae 
the Pearson correlation coefficient 
of the two variables plotted. 
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Contour | Fourier 


= Ry, 
: 
Parallel f Voronoi 


Dialog boxes Use the ?-click technique to get information about items in dialog 


boxes. You can point at any item in the dialog box: a selection list, a but- 
ton, a check box... 















Te “eter 


Suppose you decided to do a scatterplot but you weren’t sure what to do 
with the [Y variables] selection list. Point inside the list, press the mouse 
button and hold it, and then press the question mark key. 


© 1989, SYSTAT, Inc. 35 





36 





~ Help 


Y variables H variable 


STATES STATES 
REGIONS |_| | REGIONS 


DIVISIONS Select one or more variables to 
DIviStON be graphed on the (vertical) Y axis. 


ae O oO A, « we 
Axes |Bubble|Centr'd| Coord | Ellipse 
1 
ee go |S RES 
trror| Fill jLegend} Line | Log 
43 fe | Hk 
Power Smooth] Span 
+ shore 7D 
Ayt | @PA| -c *(- xX 
S‘fir {Symb'l] Title | T‘pose| Vector 


Or, maybe you don’t remember what [Proj’t] is for. Point, press the 
mouse button, and press the question mark. 











Y variables # variable 


STATES STATES 
REGIONS REGIONS 
REGION REGION 
DIVISIONS DIVISIONS 
DIVISION DIVISION 























hat Proj't offers geographic projections 
7A for overlaying plots on maps. 


"Title [T pose] vector 
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Help 


Use ?-click whenever you need a quick reminder. If you need more 
complete help for something, try the help button (the question mark in 
the upper left corner of any dialog box) or the general help facility in the 


Apple menu. 


The Editor menu has an Help item. Use Editor/Help to get instruc- 


tions for using the Data Editor. 


e Select Help from the Editor menu 









The SYSTAT Data Editor creates and edits SYSTAT files. You can enter 
data in three ways with the Editor. You can type in your data from the 
keyboard, paste in your data from the Clipboard, or read your data from 
2 SYSTAT file or 2 text-only file. If you are starting with a new 
worksheet, first enter your variable names. Variable names go in the 
top row, and begin with a letter. Variable names can contain only letters 
or numbers (no blanks). End character variable names with a dollar 

sign ("$"). Do not end numeric variable names with a dollar sign. If you 
are going to read your data from a text file, choose the TEXT button on 
the Open dialog box to see all text files on your disk and select the file 
you want to read. SYSTAT will read the file and create variable names 
for you. 














You can-use the Editor menu, or you can type these commands in the 
Commands window : 











FIND <expression> moves cursor to selected case 








FORMAT <*> sets number of decimals displayed 
SAVE <filename> saves worksheet (or use the File’ menu) 
NEW clears worksheet 

REPEAT <®> fills worksheet with cases 






LET <statement> transforms variables 
IF expression> THEN LET <statement> * 





For further help with the Editor, use the question mark buttons in the 
dialog boxes for specific Editor menu items or use HELP command in 


the Command window. 
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You can get help for the command interface by using the HELP com- 
mand. First, you must open the Command window by selecting Show 
command window from the Goodies menu. For more information 
about the command interface, see the Command reference appendices 
in the SYSTAT and SYGRAPH volumes. Also see the appendix in the 
DATA volume for complete information about the DATA commands. 


You can get both general and specific help by typing HELP alone or 
HELP with a command name , respectively. The command HELP | 
without an argument produces a list of all the commands available in the 
current “module.” If you need help for another module, switch to the 
module first by entering its name and then use the HELP command. 


For example, if you are in CORR and use HELP, you will see the fol- 


lowing: 


HELP 









The CORR module produces sums of products, covariance, and 
correlation matrices on two or more numerical variables. 


CORR commands are: 


& 


















sscP 
COVARIANCE 
PEARSON 

EUCLIDEAN 

GAMMA 

Muz2 

SPE ARMAN 

TAU 

USE, SAVE 

SUBMIT OUTPUT 
NOTE, PAGE, FORMAT 
HELP 

BY, SELECT, WEIGHT 
Quit 





For additional information about these commands, type 
HELP followed by the name of the command. 





You can then get help for a specific command by typing the name of the 


command after HELP. 
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Command help Help 


You can get further help for the PEARSON command, for example, by 
typing HELP PEARSON. 


HELP PEARSON 


Ee a ee | 


The PEARSON command produces a matrix of Pearson product 
moment correlation coefficients. 


Its syntax is: 


PEARSON [<var1>,<var2>,<...> / PAIRWISE | LISTWISE] 


Examples are: 


PEARSON (matrix on all numerical variables in file) 

PE ARSON HEIGHT , WEIGHT 

PEARSON TRIAL(1-5) 

PEARSON / PAIRWISE (pairwise deletion of missing data) 
PEARSON / LISTWISE (listwise deletion of missing data) 
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Command help 


Suppose you are in CORR and need to geth 
get help for the MDS co d 
SCALE. Just switch modules, and then use HELP. — 


MDS 
HELP SCALE 


The SCALE command begins MINDIM to MAXDIM scalings. 
Its syntax is: 

SCALE [<var1 >, <var2>,<...>] 

Examples are: 


SCALE (scale all variables in SYSTAT file) 
SCALE RED , YELLOW ,GREEN BLUE 





You can get help for SYGRAPH by switching to the GRAPH “module” 
and then typing HELP or HELP command, where command is the com- 
mand for which you need information. 


You can type HELP anywhere in SYSTAT or SYGRAPH. Don’t hesi- 


tate to use it. You can also take advantage of the NAMES and 
OPTIONS commands whenever you need to remember the variables, 
files, and options you are using. 
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| Information 

i ee 

| Information from the Goodies menu gives information about any term 
or statistic you select. 


: 


¢ Select a term or statistic in the Analysis window output (use the 
mouse to highlight the text) 
¢ Select Information from the Goodies menu (or type Command-l) 


For example, Stats/Statistics... might produce some output like this: 


Descriptive Statistics 


Total observations: 50 


PNEU_FLU 
N of cases 50 
Mintmum 12.400 
Maximum 32.400 
Mean 21.038 


Standard dev 4.140 


wt 


Can’t remember what a standard deviation is? Select “Standard dev’ in 
the analysis window, and use Information. 


& File Edit Data Graph Stats Mirmonitte Editor 


Wabatimortshaneda| 


Descriptive Statistics Show view window 
Hide command window 


Total observations: 50 
PNEU_FLU Reda last analysis 
Manie 
ious Piot tools and colors 
32.400 Show options in effect 
Pay Record preferences 
Hefine bundles... 


Transfer to... 








Help Information 


information 


Standard deviation, a measure of spread, is the square root of the 
mean squared deviation of values from the mean. Data sampled from 
a standard norma) distribution have a standard deviation of one. 


For series results, the standard deviation of the series is displayed. 
Standard deviation is the square root of the variance of the 
series data (the variable). 
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Introduction 
Variables 
Cases 
Worksheet 
Numeric and character variables 


Entering data 
Open the Data Editor 
Maximum file size 
Enter data 
Variable names 
Values 
Save data 


Editing data 
Changing variable names 
Changing data entries 
Deleting cases and variables 
Changing blocks of data 
Selecting blocks 
Copy, Cut, and Paste ~ 


Transforming data 
Math... 
Recode... 
Operators and functions 
Order of expression evaluation 
Fill worksheet... 
Filling variables with random data 


Data Editor utilities 

Find case... 
Find next 

Editor/Formats... 
Decimals to show 
Enter right/down 
Matrix type 

Help 

Select... 
Extract selected... 
Remove select 
Other ways to select data 


Special data structures 
Grouping variables 
Aggregated data 
Triangular matrices 
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Data Editor 


46 
46 
46 
46 
47 


48 
48 
50 
50 
51 
53 
56 


Be, 
ayy, 


© 1989, SYSTAT, Inc. 











' Data Editor 4 





Overview You can get data into SYSTAT four ways: 


1) You can use the SYSTAT Data Editor 

2) You can Copy or Cut data from another application 
3) You can import text files from other applications 

4) You can use SYSTAT’s powerful DATA facilities 


This chapter discusses the first two methods. It also shows how to edit 
data, transform variables, and search for cases. 


See the next chapter, Data files, for instructions on importung. 
See the DATA volume for information on the DATA command lan- 


guage for advanced data manipulations. You can also use DATA to in- 
put data from the keyboard (if you prefer) or to read fixed format ASCII 


ite _ files imported from mainframe applications. 
This chapter combines a tutorial with reference information on the 
features of the Data Editor. We mark the tutorial with sidebars, so if 
you prefer to experiment first and read details later, you can follow the 
tutorial as it winds through the chapter. 

ide ‘ 
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Introduction | 

I 
A data set is a collection of information. It contains various characteris- 
tics of a set of objects: the age, weight, and height of a bunch of people. 
The high, low, and closing prices of a group of stocks. The colors, fla- 
vors, and prices of a dozen soft drinks. The beginning, middle, and 
ending weights of patients at a diet clinic. The number of strikes, fouls, 
bases, and runs of baseball players. 


For statistical analysis, we usually arrange this information in rows and 
columns. The rows comprise cases, or the objects we have measured. 
The columns comprise variables, or the characteristics on which we 
have measured the objects. 


Variables These characteristics—ages, prices, weights—are variables. A variable is 
a measure on which cases are assessed or an attribute describing them. 


Cases A case is one of the objects on which measurements are taken—one of 
the people, one of the stocks, one of the soft drinks, one of the baseball 
players. 

Worksheet A worksheet stores cases and variables in a rows-by-columns format. Say, 


for example, you are doing a medical study. You have a sample of 10 

volunteers and are testing some new medications on them. You'll want j 
to record some basic information about them: height, weight, age, sex, 

etc. You also need some way of distinguishing each volunteer—like 

name or chart number. : 


You might write all this information in a table like this one: 7 
Name Height Weight Age Sex ; 
Mildred 5110" 140 33 F g 
Gertrude 62" 210 22 F 
Clarence 92" 110 26 M 
Melvin s'11" 160 42 M 


Here, the Name, Height, Weight, Age, and Sex columns are all vari- 
ables. The Mildred, Gertrude, Clarence, and Melvin rows are all cases. 
Note that the entire row devoted to Mildred’s characteristics (including 
her name) is one case. The entire column of names is one variable. 
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Introduction Data Editor 


You must set up data files like this for SYSTAT: with variables set up in 
vertical columns, and cases in horizontal rows. A rectangular arrange- 
ment of data like this one, with variables in columns and cases in rows, is 
called a worksheet. 


=" Numeric and A variable can be numeric or character; that is, it can contain observations 
character that are numbers (numeric) or that are some combination of letters, 
variables symbols, and numerals (character). In SYSTAT files, numeric variables 


contain decimal numbers only. 


Bea aes ft 


In our dataset, Weight and Age are numeric variables, because the val- 
a 3 ues in those columns are numbers. Name and Sex are character vari- 
=P ables, because their values are letters. 





The tricky variable is Height: is it numeric or character? The values 
contain numerals, but they also contain the’ and ” symbols for feet and 
inches, and they contain two fields: feet and inches. Numbers can only 
contain digits and an optional decimal point. Therefore, Height is a 
character variable. 


You could change Height to make it a numeric variable: you could show 
Height by number of inches, or you could use the metric system (which 
uses only digits before and after a decimal point with a single unit, me- 
ters). Or, you could convert inches to tenths of feet. 


AEE RS OSE TRE prreree: 
“te pe eee A 
a ts ~ 4 - hig a at 


: a feet'inches’ inches . Meters feet. tenths 
— 5'10" 70 1.778 5.833 
— | 62" 84 2.134 6.167 
5'2" 62 1.575 5.167 

s111° 71 1.803 5.917 


a All but the first column could be numeric variables for SYSTAT. 
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Entering data 


Open the Data 
Egitor 
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You enter data for SYSTAT by typing the worksheet into a Data 
Editor. You can open the Data Editor window two ways. You can open 
a preexisting file in the Data Editor by selecung Open... from the File 
menu and clicking [Edit] before clicking Open. 


&) Data Files 
CD DATAFILE > 


CO EKMAN 
1 ENERGY 
0 ENZYME 
0 EXPEND 
D FADE 

O FOSSILS 
O FRACTION 
CQ GROWTH 


@® SYSTAT © PICT © DLOG 


You can open an empty Data Editor window by selecting New from the 
File menu. Let’s do that now. (You haven’t started SYSTAT yet? Just 
double-click the SYSTAT application icon.) 
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Entering data Data Editor 


e Select New from the File menu 


Open... 
Import... 
Lfose 


Save #4 
Save as... 

Save selecied tent... 
Page setup... 

Paye preule ts... 

Print selected tert... 
Print geoph... 


Submit command file... 
Submit clipboard 





= 3 The Data Editor window is like any other window: you can move It, re- 
wy size it, close it, and scroll around in it. You can Cut, Copy, and Paste to 
3 and from other windows and applications. 


i The Data Editor allows you to enter data into a worksheet, view data, 
edit data, and save data. Note, however, that the Data Editor is not a 
spreadsheet. It looks like a spreadsheet, but it differs from a spreadsheet 
in a number of important ways. 
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Data Editor Entering data 


The Data Editor is not a spreadsheet 
First, you cannot store formulas in individual cells. You can use the 
Editor menu to do mathematical transformations, but the Editor only 
saves the results of the transformations—not the formula for making 


them. 


For example: say you had entered two variables, A and B. Now you want 
a third column, A+B. Some spreadsheets would let you label the third 
column with the formula A+B, and the sums would fill the column. 
Then, if you changed the value in the third case for A, the third case for 
the A+B variable would automatically change to the new sum. 











SYSTAT does not work this way. You would use Math from the Editor 
menu to “Set C to A+B,” for instance, and C would be filled with the 
correct sums. However, if you then changed A and B values, C would 
not change. You would have to “Set C to A + B” again to replace the old 
sums with the updated sums. 


Second, the Data Editor does not let you insert rows and columns like ! 
spreadsheets do. You can add variables and cases, but they are always 
placed after the last variable or column. 


Finally, the Editor can handle as large a file as you can fiton your disk, 
while most spreadsheets are limited to a few thousand cells. It keeps the 
values in a file, notin memory; therefore, it is not limited by memory. 
Because of this, the Editor is somewhat slower than spreadsheets that 
work in memory. 


Maximum file. . Your worksheet can contain up to 256 variables and as many cases as 

size you can fit on your hard disk. If you had 20 megabytes of disk space 
available, you could store approximately 10,000 cases with 250 variables, 
which would make the worksheet 22 feet wide by 130 feet long. You 
could make a worksheet just as wide and a mile long if you had about 
820 Mb of disk space. SYSTAT can handle it. 


Enter data To enter data into the worksheet, just select a cell and type in it. You 
can select a cell by clicking in it or by moving the cursor to it using cur- 
sor arrow keys. When you are done typing, press Return, Tab, or Enter. 
The cursor automatically jumps one cell to the right. 
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Entering data 


Variable names 
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Data Editor 


(You may change things so that the cursor jumps down instead of to the 
right, if you prefer. Select Formats... from the Editor menu and click 
[Enter down]. If you want to make [Enter down] permanent, use 
Record preferences from the Goodies menu.) 


First you must enter variable names: labels for each of the columns you 
want to use. In other packages, variable names are sometimes called 
variable labels, column headings, or field headings. Variables names go 
in the top row of the worksheet. 


Variable names can be up to eight characters (letters or numbers) long 
and must begin with a letter. You may only use letters and numbers—no 
symbols or punctuation marks. You may, however, use the underscore 
(_) character to represent a space, e.g. NEW_YORK. 


You may type variable names in UPPER- or lower-case, but they are 
stored in UPPER-CASE. 


We make these restrictions on variable names to ensure latera] compat- 
bility: any SYSTAT data file or command file will work with SYSTAT 
on any machine—VAX, IBM-PC/compatible, UNIX, etc. (You do 
usually need to convert the file, but you won’t ever have to rename vari- 
ables and edit your command files.) | 


Character variables 


Character variables (variables with values that are not numeric) must 
have a dollar sign ($) as their final character. The dollar sign does not 
count toward the eight character limit. 


Note: you can change variable names in the Editor, but you may not 
change a character variable to numeric or vice versa. Thus, if you 
change the name of a character variable, the new name must also end 
with a dollar sign. | 


Subscripts 


You may subscript numeric variables by placing the subscript numeral 
inside parentheses at the end of the variable name, e.g. X(1), X(2), ..., 
X(n). Subscripts are helpful because you could, for example, type 
STATS X(1-5) instead of selecung X(1), X(2), X(3), X(4), and X(5) in- 
dividually. 
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Data Editor 


Entering data 


You may not subscript character variables. If a variable name has no $ a 
the end, SYSTAT assumes it to be numeric. 


A set of subscripted variables is someumes called an array. The number 
of variables in an array is its dimension. F or example, the set of variables 


JUNK( 1), JUNK(2), JUNK(3) is a three dimensional array, 


Entering names 
To enter a variable name, select a cell and type the name and press 
Return, Enter, or Tab. The cursor jumps to the next cell to the right, 
and you can type the next variable name. 





Try it out! Notice that the first cell of the worksheet is already selected. 
Type the first variable name, TEAMS, using either upper- or lower-case 
letters. Don’t forget the dollar sign, 


SYSTAT Data Editor 


| 


Pomme me Pees 





Now press return. The variable name is centered in upper-case, and the 
cursor jumps to the right: 


sYSTAT Data Edito 
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Entering data Data Editor 


Now enter the second variable name, WINS. This is going to be nu- 


meric, so don’t use a dollar sign. Don’t forget to press Return (or Enter, 
or Tab). 


SYSTAT Data Editor 





Values After you have entered variable names, you can enter values. 

$ Numeric values 

bd You can enter numeric values as integers or real numbers, with or with- 
ee out decimal places. 


ze You can also use scientific notation. Exponents can have absolute 
“% magnitude less than 36. (That is, you can have values as large as 910° 
and as smal] as 1*10~**.) Standard exponential shift notation applies: 


= 2.9E5 = 2.9*10° = 290,000 
| 2.9E-5 = 2.9107 = .000029 


Thus, all of the following are valid numeric data values: - 


Lae ete” Bet tg ratals  oie 
1 


aa 13579 

. & 1.0£15 

; a2E=5 

— 123456789.1234 
t ¥ 1. 5e-12 

-_ 2e25 

_ 


lle PL 2 sta ape tet 
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Data Editor Entering data 


Character values 
Character values may be up to 12 characters long. You may use upper- 
and lower-case; unlike variable names, though, character values are case- 
sensitive. That is, if you type lower-case characters, they remain lower- 
case. 


So rele. a rh ie, ng gee 


If you need to use an apostrophe or single quotation marks as part of a 

character value, surround the whole value with double quotation marks. 

If you need double quotation marks as part of a value, surround the 

whole string with single quotation marks. You will not see the surround- : 
ing quotation marks in the stored value. j 


4 
You can enter numeric values for a character variable if you surround . 
the value with single or double quotation marks. SYSTAT treats such 

values as text; you cannot do numerical analyses with them. - 
All of the following are valid character values: 7 
San Francisco | 
“Chuck’s Place” B 
Three 5 
‘the “spot”’ r 
Missing values ; 
Suppose that you have a hole in your dataset. For example, Gertrude " 
might refuse to divulge her weight. Or, perhaps a stock didn’t get sold Rs 
one day, or maybe sareeets fess to record its low price for the day. m 
Perhaps one of the soft drinks is a test product that hasn’t been named . 

yet. - 


These would all be missing values. You can’t record a value in the table 
because you don’t have one. You must, however, do something about 


that gap. On your scratch pad, you would probably just leave a spot = 
blank, or maybe you’d make a hashmark to remind yourself that you a 
didn’t have a value (and that you didn’t just skip that spot accidentally ig 
when you were writing things down). - : 
Enter a decimal point (period) by itself for a missing numeric value. + 


Enter a blank space (press the Space bar and then Return) for a missing a 
character value. If you don’t enter one of these values and you leave a 
“hole” in your worksheet, the Editor will jl] missing numeric values 
with a decimal point anyway. 


an 
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Entering data 


Data Editor 


Let’s enter some values. Select the cell immediately below TEAMS by 
clicking it or by moving to it with the cursor arrow keys. 


SYSTAT Data Editor 





Type the first team name, Tigers. The value is stored and the cursor 
jumps to the nght. Now enter the first numeric value, 98. After you 
press return, the Editor should look like this: 


SYSTAT Data Editor 





Notce that the Editor is now ready for another TEAMS name. Since 
you had filled the enure first case, SYSTAT assumed (correctly) that the 


next thing we would want to do is enter another team name. 
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Data Editor 


Saving data 
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Entering data 


Enter some more data: 


SYSTAT Date Editor 


Indians | 100.000: 5; 





Notice that the Data Editor numbers your cases in the first column of 
the worksheet. In fact, CASE is a variable thar is automatically present 
in any file. 


Two items in the File menu let you save data: Save and Save as... The 
first, Save, saves the data in the current filename or, if it is a completely 
new dataset, SYSTAT will ask you for a filename. 


Say, for example, you had opened an existing file named TESTDATA 
and changed some values. If you then used Save, you would lose the old 
values which would be replaced by the new ones. Suppose you then dis- 
covered that your changes were all wrong and you wanted the old values 
back. It would be too late. If you had used Save as... instead and stored 
the new values in another file, such as TESTDATA2Z, all you would 


need to do is open the old file. 
Let’s save our data before we go any further. You never know when the 


power is going to fail or you're going to spill coffee on your keyboard. 
(Your keyboard may not survive, but at least you won’t lose any data!) 
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Entering data Data Editor 


Just select Save as... from the File menu. 


New 

Open... 

[mport... 

{iose 

Save 

Save as... 

Save selected text... 


Page setup... 

Page preview... 

Print selected text... 
— Print graph... 


Submit command file... 
Submit clipboard 36 F 


Quit : #0 





A dialog box asks for a filename. Type a name and click Save. 


" e re : 
: 4 C Data Files — Disk 


Save file as 


baseball | 
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Data Editor Entering data : 


—— ae 


SYSTAT reports that it has saved: 


eer tt oe oe 


7 cases saved. 


4 
é 
Now thar you have given a filename, you can use Save... from the File 
menu to save any changes or additions you might make. Save... saves | 
the data in the current file. : 


gt 
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Editing data 





Changing 
variable names 


Changing data 
entries 


As you have already seen, the Data Editor lets you change data values 
and variable names. Editing is the same as entering: click the cell, type 
the new value, and press Return. The old value disappears and the new 
one is stored. 


Changing values in the Editor does not change the values in saved files. 
To change what is stored on your disk, use Save or Save as... from the 
File menu. 


You can change a variables’ name by clicking its cell and typing a new 
name. However, you cannot change a vanable from numenic to charac- 
ter or vice versa. If you try, SYSTAT gives you an error message and 
leaves the name unchanged. 


You should be careful to specify each variable’s rype correctly (by using 
a dollar sign $ for character variable names). If you make a mistake, you 
will have to Delete the variable and add a completely new variable to 
the right of the last variable in the worksheet. (We'll show you how to 
delete variables and cases soon.) 


Change data values the same way you change variable names: click the 
cell and type over the incorrect value. When you press Enter, the new 
value is stored and the old orte disappears forever. (You cannot use 
Undo to change a cell back. Youll have to type it again.) 


You probably noticed that we made a few mistakes in the baseball data. 
Click the “Yankes” cell and type the correct name, “Yankees.” (You 
don’t need to use the quotation marks.) Press rerurn when you're done. 


SYSTAT Data Editor 


91.000! 


78.000! 
100.000 





Data Editor 


Deleting cases 
and variables 


Changin 
blocks OF data 


Selecting blocks 


60 





Editing data 


The second mistake is more glaring. The Cleveland Indians? One hun- 
dred wins?! Click that cell and enter the correct total, 60. 


SYSTAT Date Editor 
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To delete an entire row or column from the worksheet, use Delete. 


e Select a row or column by double-clicking in its case number cell or 
variable name cell 
e Select Delete from the Editor menu 


You cannot undo Delete. 


You can select large blocks of data. Then, you can cut or copy the cells 
to the Clipboard and paste them into a new location of the worksheet, 
or in a different worksheet, or in a different window (like the Analysis 

window, that shows the results of statistical analyses), or in a different , 
application. 


ae 


Selecting rows 


To select an entire row, just double-click in the cell containing that 
row’s case number. 


Disk:SYSTAT: baseball 
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Editing data Data Editor 


Selecting columns 
To select an enure column, double click the cell containing the variable 
name. 


Disk:SY¥YSTRIT: baseball 





Selecting blocks 
To select a block of data, click the cell in one corner of the block you 
want, drag the pointer to the diagonally opposite corner of the block, 


a and release the mouse button. 
= . ESS Disk:SYSTAT: baseball 





If the diagonally opposite comer is out of view, just drag outside the 
window, and the window will automatically scroll. Move the pointer 
back inside the window when you've scrolled far enough. 


og uetee pate 


q To select really large blocks of cells, click one corner cell, release the 
F mouse button, and move to the diagonally opposite corner cell. (You 


i a 


4 may use the scroll bars as necessary.) Then, hold the Shift key and click 
4 the cell. You can add still more to your block - Shift-clicking in an- 
4 other cell. 
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Data Editor Editing data 


Note: you can only select rectangular blocks of cells. You cannot build 
irregular shapes by Shift-clicking; SYSTAT will always force your se- 
lection remain a rectangular block. 


Selecting the entire window 
You can select all of the data by selecting Select all from the Edit 


menu... 


Hide Clipboard 
Select All +A 





Copy, Cut, and Once you have selected a range of data, you can use Copy or Cut from 
Paste the Edit menu to place it in the Clipboard. 


Note: the following techniques also apply to text in the Command win- 
dow and the Analysis output window. 


Cut 
You can cut a cell or range of cells from the worksheet. The cut data are 
removed from the worksheet and placed temporarily in the Clipboard (a 
storage place that carries from one file or one program to another, sav- 
ing the single most recent cutting). 


e Select a range of cells 
e Select Cut from the Edit menu 


Clear 
Clear works the same as Cut except that the selected material is not sent 
to the Clipboard. Since nothing goes to the Clipboard, the contents of 


the Clipboard are unharmed, just as when you remove material using 
the Delete key. 


Copy 
Copying works like cutting, except that the selected data are not re- 
moved from the worksheet. A copy is placed in the Clipboard. 
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Editing data Data Editor 


e Select a range of cells 
e Select Copy from the Edit menu 


Cut and Copy both replace the previous contents of the Clipboard with 
new material. 


You can display the Clipboard by selecting Show Clipboard from the 
Edit menu. 


Paste 
You may use Paste to place whatever is currently in the Clipboard 
somewhere else in the Data Editor (or in another window). 


spay 


e Click the upper left corner of the area where you want the data 
e Select Paste from the Edit menu 








| If you want, you can ignore the material in the Clipboard and consider 

= it thrown away. 
Fa 

. Let’s try cutting some cells from the baseball data and pasting them 
ay back. 

. 3 Ss Pisk:S STAT: baseball 

ere. Meee lil es i SLO 

x | ED eaehet POL 

- Ener sae 

ee nn a VOR OOS | 
Bg Red Sox? 
. it indians : 
a 

ane 

—— e Select the range as shown above 

. ¥ e Select Cut from the Edit menu 
R 4 
¥ 
4 

3 “a 
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Data Editor Editing data 


SYSTAT fills the area with missing values: 
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Now, paste the data back in. 


e Click the upper left cell in the empty area 
_@ Select Paste from the Edit menu 
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We're back where we started. 
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Transforming data 


We said earlier that the SYSTAT Data Editor is not a spreadsheet. 
However, the Editor menu lets you do arithmetic with columns. 
Math... and Recode... let you do algebraic transformations. You can 
make numeric versions of character variables and vice versa. Finally, you 
can use Fill worksheet... to generate random values for variables. 


Math... You can do algebraic transformations of data with Math... You can de- 
fine (or redefine) one variable in terms of other variables and constants, 
using any of the operators, functions, and relations, which are discussed 
in the next sections. The dialog box presents a blank Set statement that 
you can fill in with any mathematically valid combination of variables, 
numbers, functions, and operators. 


You can select variables and functions from the selection lists and type 


- when necessary. Be sure to place the arguments of functions inside their 
ag parentheses. You can use the Tab key or the mouse to move from box to 
S - box. 
=. In baseball there are 162 games in a season. In our file, the variable 
a WINS contains the number of wins for each team in the Fastern 
Re Division of the American League for the 1986-87 baseball season. To 
ee enter the number of losses for each team we could name a new variable 
Ty, called LOSSES, calculate the number of losses by hand, and enter them 


cell by cell. Or, we could use Math... 


ae ¢ Select Math... from the Editor menu 
ame e Fill in the Set statement: type LOSSES in the first box, click in the 


a second box and type 162—WINS 
= @ Click OK 
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Recode... 
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Transforming data 


Variabie Variable or expression 





Watch the lower left corner of the Editor window. An odometer shows 
you the pace of SYSTAT’s calculations. When the calculations are 
done, the new variable appears to the right of the last variable in the 
Editor. 


Disk:SYSTAT: baseball 





Recode... lets you define variables in terms of others conditionally with 
if-then statements. The dialog box presents a blank if-then statement 
which you can fill in with any mathematically valid expressions. 


Let’s make a character variable showing which teams have extreme win- 
loss records. We want to make a LEVELS variable that says ‘Extreme’ 
for any team that has won or lost more than 9) games. For this, we need 
to use the logical operator [Or]. Note, however, that [Or] is not avail- 
able in the Recode... dialog box: 
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Transforming data Data Editor 










Test uarinsble @= O~»> Variable or expression 
nt > O~ 
O<« Oo 
Variable Variable or expression 
then t 
0 
set 









If you need to use [And] or [Or] in your test condition (your “if” clause), 
Z click [Complex recode...]. A new dialog box comes up, and we’re ready 
- to go. 


e Select Recode... from the Editor menu 

@ Click [Complex recode] 
: e Put together the test condition by selecting and typing: 
ae e Build the first condition: WINS>95 

) © Click [Or] 

e4 | @ Build the second condition: LOSSES>95 
5 





e Put together the “then” clause: 

e Type LEVEL$ in the second box 

e Type 'EXTREME’ in the third box; don’t forget the quotation marks 
e Click OK 


ek 
4 
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Data Editor Transforming data 


Watch how the box in the lower right corner of the dialog box stores 
the parts of your test condition. You can keep clicking [And] and/or 
(Or] (try reading that ten times fast!) to add more condiuons. 


TEAMS 


WINS 
LOSSES 


Test expression (And) 
«ones 


Variable Veriable or expression 
(eves __|tofentreme} 


WINS?95 Or 





Again, we see the odometer and then a new variable appears. 


+ TEAMS} __WINS____}_ LOSSES resiles 
Tigers: 64.000: 
= aoe emremnemmnenenmrris qn “4 





Operators and __ Following are the operators, relations, and functions available with 
functions Math... and Recode... 


ee 
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Transforming data Data Editor 


Arithmetic operators 
Arithmetic operators tell SYSTAT to perform a mathematical calcula- 
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tion between two numeric expressions. 


> #1 4 


addition 
subtraction 
multiplication 
division 
exponentiation 


Here are examples of expressions you could use inside Set and If-then 
statements: 


2*3 
X-2 
A/B 


AGE*2 


AGE“BEAUTY 


SYSTAT sets the resulting values of inadmissible operations (e.g. divi- 
sion by zero) to missing. 


Functions 


Functions modify the variable, number, or expression you place inside 
the parentheses. 


SQR 
LAG 
LOG 
EXP 
ABS 
SIN 
Cos 
TAN 
ASN 
ACS 
ATN 
ATH 
INT 
LGM 


square root 

lags variable by shifting values down one row 
natural logarithm 

exponential function 

absolute value 

sine 

cosine 

tangent 

arcsine 

arccosine 

arctangent 

arc hyperbolic tangent (Fisher’s z) 
integer truncation 

log gamma 

unary minus (negative) 
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Data Editor Transforming Cata 


Here are examples of expressions you could use inside Set and [f-then 
statements: 


SQOR(3) 
INT(CARDIO+CANCER) 
SIN(COS( TAN(3*Y) )) 


Arguments of trigonometric functions must be in radians, not degrees. 
LOG is the natural logarithm. You can obtain logs in other bases by di- 
viding the natural log by the log of the base, e.g.: 


LOG(X)/LOG(10) 
LOG( INFO)/LOG(2) 


SYSTAT sets the resulting values of inadmissible operatons (e.g., 
square root of a negative number) to missing. 


Relational operators 
Relational operators are used in relational expressions to compare either 
two numeric or two character expressions. 


less than 

equal to 

greater than 
> not equal to 
<= less than or equal to 
=> greater than or equal to 


AV IA 


Here are examples of expressions you could use inside Set and If-then 
statements: 


AGE>BEAUTY 

X <>. 

GROUP >= 2 
400<(CANCER+CARDIO) 
STATES="NY° 


Logical operators 
The logical operators [And] and [Or] connect two relational expressions 
to make a complex relational expression. 


ee ee 
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Here are examples of expressions you could use inside Set and If-then 
Statements: 


AGE>17 And AGE<=60 
SEX$="MALE’ Or WEIGHT>300 
(GROUP>5 Or SEX=1) And 400<(CARDIO+CANCER) 


The logical operator [Not] reverses the value of an entire expression. 
[Not] sets any nonzero value (true) to zero, and any zero value (false) to 
one (true). For example: 


If Not (PRICE<5000 And MAXSPEED>100 And PASSGRD>4) 
then set PURCHASE$ to ‘FALSE’ 


Note thar even if you are testing for different values on the same vari- 
able you must specify a separate relational expression for each value and 
join those expressions with logical operators. In the first example, 
AGE>17 And AGE<=60 is correct. The following, however, would not 
work: 


AGED17 Or <=60 
60>=AGE>17 


Some examples of legal statements using [And] and [Or] are: 


If REGION=3 And CARDIO>420 then set RATE$ to ‘HIGH’ 
Find CARDIO>430 Or CARDIO<400 And REGIONS="WEST"™ 


(Instrucnons for Find... appear in the “Data Editor utilities” section of 
this chapter.) 


You can assign the results of a relational expression to a numeric van- , 
able. The variable receives a value of one if the expression is true; | 
otherwise it receives a value of zero (false). 
Set TRUTH to A > 0 Or B < 100 

In this example, any case where A is greater than 0 or B is less than 100 


will have value 1. Any case where A is less than 0 and B is greater than 
100 will have value 0. 
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(Note: SYSTAT follows the standard for programming languages and 
returns 0 for false and 1 for true. Microsoft Basic 1s nonstandard and 


returns 0 for false and —1 for true.) 


You may not use logical operators to join two “then” statements. The 
following statement using And and Or is not possible: 


If GROUP=1 then set TEST to l And set TRIAL to 2 


Multi-variable functions 


Cross-variable functions, unlike other SYGRAPH operators, take only 
variables as arguments. They are also unique in that they use only 
nonmissing values in their calculations and exclude any variables that are 


missing. 


AVG mean of nonmissing values 

SUM sum of nonmissing values 

MIN — minimum value of values 

MAX maximum value of values 

STD __ standard deviation of nonmissing values 
MIS number of missing values — 


If we wanted to find the mean of the three variables A, B, and C for each 


case, we could use either of the statements: 


Set MEAN to (A+B+C)/3 
Set MEAN to AVG(A,8.C) 


In the first statement, MEAN is set to missing when any of the three 
variables is missing. In the second statement, AVG returns the mean of 
the nonmissing variables. MEAN is missing only if all three are missing. 


You can use subscripts if you want to sum the values in an array or set of 


contiguous variables: 


Set TOTAL to SUM(X(1-20)) 
Set TOTAL to SUM(A-Z) 
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Distribution functions 


There are three kinds of distribution functions: cumulative, inverse, and 
random variate. 


Distribution Cumulative Inverse Random data 
Uniform UCF(x) UIF(a) URN 

Normal ZCF(z) ZIF(ax) ZRN 

t TCF(t, df) TIF(a, df) TRN(df) 

F FCF(F,df1,df2) FIF(a,df1,df2) FRN(df?,df2) 
Chi-square  XCF(y?, df XIF(a, df) XRN(df) 
Exponential ECF(x) EIF(cx) ERN 

Gamma GCF(y,p) GIF (ax, p) GRN(p) 

Beta BCF(B,p,q) BIF(a., p,q) BRN(p,q) 


Cumulative functions compute the probability that a random value from 
the specified distribution falls below the given value (i.e. the proportion 
of the dismbution below that value). Inverse functions take a specified o 
(a probability value between zero and one) and return the critical value 
below which lies that proportion of the specified distribution. Random 
variate functions generate pseudo-random variates from the specified 
distribution. 


The algorithms for all functions were chosen to favor numerical accu- 
racy over speed. Some functions may take time to evaluate for some 
choices of parameters. Please be patient. 


SYGRAPH supplies eight distributions: uniform, normal, t, F, chi- 
square, exponenual, gamma, and beta. Uniform distribution refers to a 
distribution uniformly distributed from zero to one. Norma! distribu- 
on is the standard normal distribution N~(0,1) with mean zero and 
standard deviation one. Exponential has parameter one. All the others 
require you to specify their parameters. These parameters may be non- 
integer, 

Use the cumulative distribution functions to obtain probabilities asso- 
ciated with observed sample staustics. For many applications, to obuin 
“significance levels” from the FCF or XCF functions you must find the 
upper tail probabilities, e.g.: 


Set P to 1 - FCF(4.14,3,7) 
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Transforming data 


You can use the functions we provide to evaluate probabiliues for distri- 
butions for which there are no explicit cumulative distribution functions. 
For example, you can evaluate the cumulative binomial probabulty of k 
or more occurrences in 2 trials with binomial probability p using the 
BCF function: 

Set BINPROB to BCFC(P,K,N-K+1 ) 


Using the GCF function, you can evaluate the cumulative Poisson prob- 
ability of the number of random events between zero and & when the 
expected value is .X: 

Set POISSONP to 1 - GCF(X,K+1) 


Use the inverse distributions to determine critical values and to con- 
struct confidence intervals. They are also handy for power calculations. 


You can use transformations of the random variate functions to produce 
random deviates of the desired forms. To produce a random normal de- 
viate with the required mean and standard deviation use one like: 

set NORMAL to MEAN+STDDEV*ZRN 

To produce a uniform deviate between A and B use an expression like: 
Set UNIFRMAB to A+(B-A)*URN 

To get an exponential variate with parameter lambda, use: 

Set EXP to ERN/LAMBDA 

To get an extreme value variate with location alpha and scale beta, use: 
Set EXTREME to ALPHA - BETA*LOG(ERN) 

To get a gamma variate with shape alpha and scale beta, use: 

Set GAMMA to BETA*GRNCALPHA) 


If ALPHA is an integer, then the above expression returns an Erlang 
variate. 
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Data Editor 


Random number generators 


The algorithm for uniform variates generates the numbers by a triple 
modulo method. Each uniform is constructed from three multiplicative 
congruential generators with prime modulus. The initial seeds for each 
generator are 13579, 12345 and 313 (Wichman and Hill, 1982). You can 
modify the last seed of this generator by using the RSEED command. 
Select Show command window... from the Goodies menu and enter 
the command: 


RSEED=seed 


where seed is an integer between | and 30,000. Press Return to execute 
the command. 


S¥STRT Command 





The pseudo-random variates for the other distribution functions are 
generated by applying the appropriate inverse cumulative distribuuon 
function to uniform random variates between zero and one. 


Note: if you have been using SYSTAT Version 3.1 or earlier, note that 
the names of the SYSTAT distribution functions have been changed. 


Old name New name 
URAN ¥#£UURN 
NRAN NRN 

CDF ZCF 

IDF ZIiF 


75 


Data Editor Transforming data 


Order of Expressions are evaluated from left to right according to the precedence 
expression of operators. That is, operators with higher precedence are evaluated 
evaluation before those with lower. Order of precedence from highest to lowest : 


runs as follows: 


1) Expressions enclosed in parentheses () 
2) Exponentiation A 
3) Unary minus 


4) Multiplication and division Fe 

5) Addition and subtraction +, - 

6) Relational operators =, <>, <, >, <=, >= 
7) Logical operators And, Or 

8) Logical negation Not 


This precedence is the same as in the BASIC programming language 
(which is discussed in DATA). Because of this order of precedence, the 


expression 

A+B+C 
is the same as 

A+(B*C) 
and not the same as 

(A+B)*C 


Muluplication (*) has a higher precedence than addition (+) and is 
therefore performed first. If A=1, B=2, and C=3, the above a 
evaluate as follows: 


A+BeC= 142-3 =146=7 
A+(B+C) = 14+(2-3) = 1+6=7 


wid tumae Ft 


(AsB)*C = (14+2)-3 = 3-3 =9 
Fill Fill worksheet... from the Editor menu fills a specified number of ] 
worksheet... cases with missing values. : 
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with random data 


ae 


E.© 1989, SYSTAT, Inc. 


You must first name the variables you want to fill. 


e Select Fill worksheet... from the Editor menu 
e Specify the number of cases to fill 
e Click OK 





Fill worksheet with rows. 






There are 0 rows in worksheet. 


Fill worksheet to rows. 


Data Editor 


Once you've filled a variable or set of variables with missing values, you 
can use Math... to fill the variables with random values using the URN, 


above. 


ZRN, TRN, FRN, XIF, ERN, GRN, and BRN functions discussed 


For example, here we fill the variable A with normal random data. 


e Select Math... from the Editor menu 
e Fill in the statement: Set A to ZRN 
e Click OK 
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Find case... 


Find next 
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You can search your data for values meeting specific conditions with 
Find case... from the Editor menu. 


Test variabie ©O= Variable or expression 
Fingwins | @> Oc 
O< Oo 





You can use the functions, operators, and relations discussed above for 
your Find statement. If you need to use the logical operators [And], 
[Or], or [Not], click [Complex find]. 


Find... starts its search fromt the current case (wherever the cursor is 
resting) and goes down the file. When SYSTAT finds a case that meets 
your condition, it moves the cursor to that case in the (first) variable 
named in your Find statement. 


After a Find... is done, Find next becomes available in the Editor 
menu. Find next executes a new search on the same Find statement. 


Try using Find case... on the USDATA dataset. 
e Select Open... from the File menu 
e Select the USDATA dataset from the Data Files folder 


@ Click [Edit] 
e Click Open 
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Data Editor utilities Data Editor 


The cursor should be resting in the top row. Try searching for 
Michigan. 


¢ Select Find case... from the Editor menu 
e Fill in the statement: Find STATES = ‘MI 
® Click OK 


The cursor moves to the STATES column on case 13. 


| Disk:SYSTAT:USDATA 
= iREGION$ | REGION {DIVISIONS —__ 
ji 2.000; EN Central fF: 
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’ Now, move the cursor back to the top of the file and do another search. 


Select Find case... from the Editor menu 
e Fill in the statement: Find REGION = ? 
® Click OK 
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Now, repeat the search. 
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‘ e Select Find next from the Editor menu 
‘ 4 The-next case with REGIONS? is case 11. 


i 
a 
4 
t 
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Now, let’s try a complex search. Move the cursor back to the top of the 
#. 
e Select Find case... from the Editor menu 


e Click [Complex find] 
e Build the statement: Find CARDIO<400 [Or] CANCER>230 


The moves to the second case in the CARDIO column, the first case 
where CARDIO is less than 400 or CANCER is greater than 230. In 
this instance, the first expression in the statement, CARDIO<400, is 


satisfied. 


ACCIDENT 
CARDIO 
CANCER 
PULMONAR 
PNEU_FLU 


Test expression 
rng{eencemasg 3 


CARDW<400 Or 
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Note that even though the case meets the condition CANCER>230, the 
aa cursor is in the CARDIO column, since CARDIO is the first variable in 
a oe the Find statement. 


= q Editor/ Formats... in the Editor menu lets you control characteristics of the 
a Formats... Data Editor. 





: Z: Decimals to show . 
; 4 O00O102@30405 060708089 

- ® Enter right © Enter down 

; be @ Rectangular © Covariance O Similarity 
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Data Editor 


Decimals to 
show 


Enter right/down 


Matrix type 


Help 


Select... 
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Data Editor utilities 


In the Data Editor, numbers are displayed with three decimal places by 
default. If you want to change this, choose a different number. 


Your choices affect only the current display and not the way the data are 
stored; the next time you open the file, the numbers will have three dec- 
imal places. If you always want to see the same number of decimals, you 
can record your choice with Record preferences from the Goodies 
menu. 


After you enter a data point by pressing Return, the cursor jumps one 
cell to the right. If you want it to jump one cell down instead, choose 
[Enter down]. 


Data files are [Rectangular] by default. If you are entering a matnx (see 
triangular matrices below), choose the appropriate type. If you have 
opened a matrix file generated by SYSTAT, the correct type will be 
shown. Thus, you can use Formats... to find out what type file you 
have. 


You can get help for the Data Editor with Help from the Editor menu. 
This and other help items are discussed in the previous chapter, Help. 


Select... from the Editor menu selects subgroups of cases according to 
the test condition(s) you specify. Subsequent analyses use only the cases 
you select. ‘ 


You can link several test conditions together with the logical [And] and 
[Or] buttons. The Select... dialog box provides a selection list of van- 
ables and another of SYSTAT’s functions for your convenience; you 
may build the Select statement with any combination of selecting, typ- 
ing, and using the [And] and [Or] buttons. 


e Select Select... from the Editor menu 
® Build the Select statement; click [And] or [Or] between expressions 


as necessary 
e Click OK 
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select REGION=1 And ACCIDENT<35 Or CARDIOQ>450 


Select expression 


Select |CARDIO>450 


be a a 
. Sa ie 


REGION=1 And 
ACCIDENT <35 Or 
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Oops! If you try this, you’ll discover that fourteen cases are found, and — 
some of them do not have REGIONS1. This is because SYSTAT reads 
Select statements from left to right, so the implicit grouping is: 


f° 


select (REGION=1 And ACCIDENT<35) Or CARDIO>450 


. a eg i = , 
F L ne, . » oy .. 
oP sia = - ral : ee : rs 
eeSL em: oy weigh a af . ics 
- : = sR ts 


Wich our Select statement, any case that is in region 1 and 
ACCIDENT<35 is selected, and any case in any region with any accident 
rate that has CARDIO>450 is also selected. What we want is actually 


FO PRT WAR IRE RES te TE * NG 


the following: 

, Select REGION=1 And (ACCIDENT<35 Or CARDIO>450) 

Now we get what we want. SYSTAT reports “6 cases were found” and 

fo marks them with bullets in the CASE column of the Editor. We scrolled 

, ¥ right in the Editor window so you could see the ACCIDENT and 

ty CARDIO values. All six cases are in Region 1. 
a , 
MS 4 
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For graphics and analyses only selected cases are used. If we were to use 
Corr now, for example, we would get a correlation matrix based only on 
those six cases. However, if you use Math..., Recode..., or Select... 
the entire data set is used. 


Randonr subsamples 
You can select random subsamples of data with Select... For example, 


Select URN<.5 


selects a random subset of approximately 50% of the cases. 


Extract You can do analyses on selected data either by proceeding from here 

selected... with cases bulleted in the Editor, or by extracting the data to their own 
data file and using that file. It is handy to extract cases to a file if you 
anticipate needing to use them in future SYSTAT sessions. Otherwise, 
the temporary selection is sufficient. 


To extract selected cases and store them in a separate file, use Extract 
selected... in the Editor menu. (You must select cases as shown above 
before you can extract them.) | 


e Select Extract selected... from the Editor menu 
e Specify a filename and folder location 
e Click Save 


You can open the new data file to analyze the extracted cases. 


Remove select Select... stays in effect unl you quit SYSTAT, close the Editor, or 
: - open a new data file. To turn off Select..., select Remove select from 
the Editor menu. 
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Data Editor utilities 


Other ways to 
select data 
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Data Editor 


The Select... item in the Data menu also lets you do case selection for 


analyses. Data/Select... is discussed in the Analysis chapter later in this 
volume. 


You can also do case selection with the scatterplor brushing tools dis- 
cussed in the Exploratory data analysis chapter. 


Here is a quick comparison of the three ways of doing selection: 


Editor/Select... Brushing toois Data/Select... 


Method of selecting cases algebraic visual algebraic 
Logical operators available And, Or — And 
Selected cases marked in Editor in Editor and piot — 
Windows that must be open Editor Editor and View — 


The Editor and brushing methods allow cumulative selection (selecung 
more cases to add to the selection) and all three allow quick de-selec- 


tion. Obviously, the scatterplot brushing tools only work with scatrer- 
plots 


In general, Editor/Select... is more powerful and brushing tools are 
more intuitive than Data/Select... However, Data/Select... is more 
convenient when it is preferable not to have the Data Editor open, since 
having the file available in the Editor can slow SYSTAT down. 
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Special data structures 


I 


The datasets we’ve looked at so far have been fairly conventional: one 
variable for Case (automatically provided by SYSTAT) and individual 
variables recording certain characteristics for each case. There are a few 
different types of data formats you might need: grouping variables, ag- 
gregated data, and triangular matrices. 


Grouping A task familiar to statisticians is determining whether categories of cases 

variables differ from each other. You might want to compare weight losses be- 
tween men and women, or popularity between colas and non-colas, 
prices between different types of stocks, and so on. 


You can do this by using grouping variables. A grouping variable is a nu- 
meric or character variable that does nothing more than label each case 
with its group. 


Grouped datasets let you do comparison and contrast tests: grouped box 
plots, ANOVAs, chi-square tests, summary statistics by groups, category 
plots and bar charts, etc. 


The AFIFI dataset records increases in systolic blood pressure after 
administering one of four different drugs to patients with one of three 
different diseases. Open the AFIFI dataset and take a look at it. 


e Select Open... from the File menu 

e Click [Edit] 

® Choose AFIFI from the Data Files folder 
e Click Open 
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DRUG and DISEASE are both grouping variables. Scroll through the 
dataset and notice how DRUG has only four values, 1, 2, 3, and 4. 
DISEASE has three values, 1, 2, and 3. Each patient in the study is a 
separate row, or case. You can see which disease each one had by look- 


= ing at the value for DISEASE in the row, and similarly which DRUG 
z= the patient was given. 


: ~ Disk:SYSTAT:AFIF I 





: 3 

; 3 : 
; i We see, for example, that the seventh patient (the row where Case is 7) | 
i has the DISEASE 2 and was given DRUG 1. After the drug was admin- 

: istered, Mr. Seven (Ms.Seven) had an increase of 33 in his (her?) systolic 
8 blood pressure. 
4 3 . 
; & We will see this dataset used in the the MGLH chapter of the SYSTAT 7 
7 manual for two-way analysis of variance. 


eo ee re ge 
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Data Editor 


Aggregated 
data 
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Special data structures 


Aggregated datasets group objects with the same measurement values as 
a single case in the file. The CHEST dataset counts the number of 
soldiers with given chest measurements. With the regular data format, 
we would enter a separate case for each soldier, with variables like 
NAMES, CHEST_IN, AGE, etc. This format is great if you want to 
record several characteristics, but we’d spend days typing Mr. Quetelet’s 
5738 cases. This would be silly, especially since we don’t care about any 
other characteristics. A frequency distribution would be more manage- 
able and more useful. 


Here is how the data look in a frequency distribution. The unique values 
in this study are the chest measurements. So, we name a variable 
CHEST_IN, and list all the different measurements down the column. 
Then, we add a count variable, COUNT. 


CHEST_IN COUNT 
33 3 
34 18 
35 81 
36 185 
37 420 
38 749 
39 1073 
40 1079 
41 934 
42 658 - 
43 370 
44 92 
45 20 
46 21 
47 4 
48 1 


As you can see, this is a sensible way to set up the data. Interestingly, we 
can even read the COUNT values as 2 rough histogram: note that the 
numerals themselves (as well as their values) widen in the middle and ta- 


per off at the ends. 


The DANISH dataset combines grouping variables with a COUNT 
variable. This dataset is provided in the Data Files folder. 
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Data Editor 


Both FATHER and SON are grouping variables indicating to which 


economic class the father or son belonged. 


The COUNT variable 


shows how many combinations existed. We can see, for example, that of 
the 2391 father-son pairs included in this study, 18 pairs had both father 
and son from the Ist class. 217 pairs involved class 3 fathers who had 
class 4 sons. As you can see, relatively few sons were able to rise to an 


economic class above that of their fathers. 


We analyze the DANISH dataset in the T 


ables chapter of the SYSTAT 


_ manual to see what patterns exist among fathers’ and sons’ economic 


classes. 
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Special data structures 


Sometimes you may wish to input your data in triangular form. Corr, 
for example, computes correlation, similarity, and dissimilarity matrices. 
A correlation matrix shows the correlation coefficients for each pairing 
of variables: 


ACCIDENT CARDIO CANCER PULMONAR PNEU_FLU 
_ ACCIDENT | 1.000 
CARDIO -0.506 1.600 
CANCER -0.578 0.908 1.900 
PULMONAR -0.099 0.441 0.438 1.000 
PNEU_FLU -0.196 0.538 0.358 0.400 1.060 


This matrix is included in the Corr/Pearson... output for five variables 
fom the MEDICAL dataset. To find out the correlation between 
ACCIDENT and CANCER, for instance, look at the cell in the 
CANCER row and the ACCIDENT column. The correlation is -0.578. 
Note that correlation matrices are symmetric, so only the lower diago- 
nal portion is shown. Since variables always correlate perfectly with 
themselves, the diagonal always has trivial values of 1. 


Suppose you wanted to factor this correlation matrix. You would just 
save a file with Corr, open the file, and factor it with Factor. The upper 
diagonal portion is filled with missing values, so the file would look 
something like this: 


ACCIDENT CARDIO CANCER PULMONAR = PNEU_FLU 
1.000 . 
-0.506 1,000 
-0.578 0.908 1.000 
-0,099 0.441 0.438 1.000 ; 
-0.196 0.538 0.358 0.400 1.000 


Now, suppose you wanted to factor (or do regression or statistics or 
graphs...) a matrix that somebody else published and you don’t have the 
original data. Or perhaps you want to type in a dissimilarity matrix di- 
rectly for Cluster analysis. 


The Editor lets you input matrices directly. As usual, variable names go 
in the top row. Don’t label the cases like the correlation output does; the 
file should look like the matrix directly above. 
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a After you enter the data, you must tell SYSTAT what type of matrix itis 


ae with Formats... from the Editor menu. It is important to do this before 
a you save the file. 
Se. 


oa ¢ Select Formats... from the Editor menu 
ae _ © Choose the type of matrix: [Correlation] 


= As you can see, the default is [Rectangular]. Rectangular files are the 
: usual files of variables and cases. The other five types are triangular 
— matrices that can be produced with Corr. [SSCP] stands for Sums of 


og Squares and Cross Products. (See the Corr chapter in the SYSTAT 
- manual for more information on these matrix types.) 
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Data files 5 


OT 


Overview This chapter discusses data files: how to name them, save them, print 


them, exchange them with other applications, sort them, rank them, 
standardize them, and transpose them. 


| a We discuss saving and printing output (data files or text produced by 
- statistical analyses) in the next chapter, Redirecting output. 
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Saving data in files 


fa Sg ge a 


Filenames SYSTAT filenames are subject only to the usual restrictions imposed by 
Macintosh conventions. You may use upper- or lower-case characters, 
spaces, and some symbols. If you try to specify an illegal filename, you 
will receive an error message. | 


The data files that we provide with SYSTAT in the Data Files folder are 
named in all upper-case characters. Again, this is not necessary. We 
have just chosen to name files this way so you can always recognize file- 
names easily when reading our text. 


Previous versions of SYSTAT imposed some additional limitations on 
filenames; these limitations are no longer in force. You no longer are 
limited to eight upper-case alphanumeric characters: you may now use 
periods, underscores, spaces, and numbers. You no longer need to use 
the SYS, .CMD, and .DAT filename extensions (suffixes). If you have 
files created with previous versions, the old filenames will work fine with 

this version. You do not need to rename any old files unless you want to 
do so. 


Note that if you choose to use the optional command interface, you 
must surround folders and.filenames with single or double quotation 
marks, 


Also note that SYGRAPH map files must still have a .MAP suffix. We 
discuss .MAP files in more detail in an appendix to the SYGRAPH vol- 
ume. 


Saving data You can save data with two items of the File menu. Save saves your data 
(or your changes) to the current file, overwriung the previously-stored 
data. Save as... provides a filename which you can change. Save as... 
also lets you save your data in different formats suitable for exporting to 
another application. 


Save Save from the File menu saves your data in the current file and locauon 
(folder), writing over any previous data that may have been stored under 
that filename. | 
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Saving data in files Data files 


If you started with a New worksheet, Save gives you the Save as... dia- 


log box (discussed next) so you can specify a filename and folder loca- 
tion. | 


e Select Save from the File menu 


: © Specify a filename and location 
| e Click Save 


Save as... Save as... from the File menu lets you specify a filename and folder lo- 


cation. You may also save your data files in one of two plain text (ASCID 
formats, [Text] and [Tabs.} 


e Select Save as... from the File menu 

e Specify a filename and folder location 
a © Choose a text format for exporting 

. 4 @ Click Save 
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Save file as 


@®SYSTAT OText OtTabs ODLOG ; 
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: SYSTAT format 
— The default file format is a SYSTAT file. If you are going to do statsu- 


cal or graphical analyses on the data you save, they must be in SYSTAT 
format. 


a , 
a Plain text formats ; 
, You can save SYSTAT data files in one of two plain text formats. [Text] 
' § saves your data in comma-delimited form, where successive values in 2 : 
Tow are separated by commas and lines are separated with Carriage re- 
turns. [Tabs] saves in tab-delimited form, where successive values in a : 


a: Tow are separated by tabs into columns; lines are separated with carrlage 
oe | returms. 
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Single and 
double precision 


96 


Saving data in files 


You may read plain text data files into other applications such as 
Microsoft Excel. Then, you can save results from that application in text 
form and read them back into SYSTAT with Open... from the File 


menu. 


SYSTAT saves data files from the editor in double precision by default. 
Double precision allots 8 bytes per value, which retains up to 15 signifi- 
cant digits in your data. 


You can also save in single precision, which allots 4 bytes per value and 
ensures approximately 7 significant digits. Saving in single precision 
takes up about half the disk space as double. SYSTAT does all its calcu- 
lations in double precision, so it converts single precision data to double 
before processing. 


All character values are stored in 12 byte strings regardless of whether 
you save in single or double precision. 


Again, SYSTAT saves data from the editor in double precision by de- 
fault. To save data in single precision, you must use the command inter- 
face. Select Show command window from the Goodies menu. Then, 
type at the prompt, pressing Return at the end of the command. 


YWRITE filename / SINGLE 


In the command above, filename represents the filename you give to the 
file. If you want to save in a different folder or specify a filename that 
includes blank spaces, you must surround the complete folder locanon 
and filename with single or double quotation marks: 


SWRITE ‘Disk:SYSTAT:My Data:myfile’ / SINGLE 


For more information about the optional command interface, see the 
Command reference chapter at the back of each manual. 


When you save data produced by statistical procedures, a dialog box asks 
whether to save in single or double precision 
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Saving data in files Data files 


Problems? 


Here are some guidelines for avoiding problems. 


1) Be sure you have enough space on your hard disk. As a general rule, 
you should try to make sure that you have about four times more free 
disk space than your largest data file. (You can see how much space you 
have by looking at the window of your topmost “folder”—the one you 
see by double-clicking the hard disk icon. Select By icon from the View 
menu befored starting SYSTAT. The top right corner of the window 
shows how much hard disk space is free.) 


2) Save frequently whenever you are working with data files. That way, 
if the power fails or you spill tea on your machine, you will only lose a 
small amount of work. 


3) Because SYSTAT operates both on commands and menus, itis a 
good idea not to include apostrophes in folder or filenames. SYSTAT, 
like most other computer languages, uses apostrophes as character de- 
limiters in commands. While it is trivial to process apostrophes through 
menus, there are conflicts you can encounter when they appear in com- 
mands. If you have problems, such as an “empty file” error, just remove 


the apostrophe and try again. . 
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Printing data 


You can use Print data... from the Editor menu to print your data. 
Page Setup in the File menu lets you set up how things are to be 
printed from SYSTAT. 


Page Setup... Page Setup... lets you choose options for printing. Each type of print- 
ing device has its own dialog box, so what you see in the dialog box de- 
pends on what kind of printing device (LaserWniter, ImageWriter, etc.) 
you have hooked up and what version of the printer drivers you are 
using. 


Print data... Print data... prints the data in the file currently in the Data Editor 
window. 


e Select Print data... from the Editor menu 


Other ways to You can use text files or the Clipboard and Cut, Copy, and Paste to 
‘print data move all or part of a worksheet into a text ediung application (such as 
WordPerfect or Microsoft Word) where you can format the data to 
your needs before printing. Instructions for exporting data to other 
applications through text files or the Clipboard appear in “Exportng” 
section of this chapter: 


| You can use the DATA facilities to print all or part of a dataset. See the 
first several chapters of the DATA volume for instrucuons. 


Problems? If you have problems printing data or output from SYSTAT, be sure 

| that you have current versions of the appropriate drivers (for a 

7 LaserWriter, for example, you need both LaserWniter and LaserPrep) 
present in your System Folder. Often problems occur because these files 
are either missing or damaged, and you can fix things by merely recopy- 
ing the files from the master disk, restarting your Macintosh, and trying 


Hi again. 


98 © 1989, SYSTAT, Inc. 


-~- 





oa me Sees 


at Sy yee mya. ore ; tes ead eer ae 
wnat gl a ee tos Ia Th wi LY ELE ye 
: - : 5 -, ar ee = 
rt wank CLIT PF, Ae re tek a Sgt 
Ph -¥ 2 te Ie + is fle tag : me 


aad ghee we ts i. * 
« ar ime 


Importing data 
meee 


You can import data from other applications three different ways. 


1) You can Cut or Copy data in the other application, switch to 
SYSTAT, and Paste the data into the SYSTAT Data Editor. 


2) You can save data from one application in a text file, switch to 
SYSTAT, and read the text file into the Data Editor. 





3) You can use SYSTAT’s Import... facility to read Excel data files or 
to do fast importing of ASCII text files. 


The Clipboard technique is a quick way to import small amounts of 
data. For larger files, we recommend using external text files. Also, you 


a need to use text files when importing data from other platforms, such as 
4 from a UNIX machine. 

fee: Pasting data You can transfer data from another application to SYSTAT through the 

ame from the Clipboard. 


wae Clipboard 


You can paste into a new worksheet or an existing data file. 


e Open the other application * 

e Select the range of information 

° Select Cut or Copy from the Edit menu 
¢ Quit from the application 


e Start SYSTAT 

e Select New from the File menu and labe! the variables, or Open... 

i, 2 an existing SYSTAT file 

¢ Click the upper right corner cell of the range where you want the 
data to appear 

e Select Paste from the Edit menu 

¢ Save your new data 
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Delimiters 
The pasted data must be delimited by commas, spaces, or tabs. 


ma 
a) 
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Data files Importing data 


Variable names 
If you do not transfer variable names with your data, you must label the 
variable columns before pasting the data. If you don’t, SYSTAT names 
the columns COL01, COL02, ..., COLz, and of course you can edit 
those names after you paste the data. 


Character values 
Any character values that contain spaces or single or double quotation 
marks must be surrounded with (the opposite type of) quotation marks. 


Reading a text You can read text files saved from other applications with Import... 
file from the File menu. 


You must save the data from the other application as plain text. Most 
applications allow you to choose the file format in which to save with 
their Save As... dialog box. Choose [Text only] or a similar plain ASCil 
format. 


¢ Open the other application 
e Save the data file in a text only format 
e Quit from the application 


e Select Import... from the File menu 
@ Choose [ASCII] 

e Select the file 

e Click Open 
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Importing data Data files 


ii Data Files 

1 meminfo 

© Subroutines 
De DO Systat Help 





a Note that after you click [Text], only text-format files are included in 
. 3 the selection list. 
~~ The file is translated and saved directly in a disk file. If you want to use 
a the file right away, use File/Open.... 
| 4 Delimiters 
3 The text files must separate values (columns) with tabs, commas, or 
-_ spaces. These characters, which tell SYSTAT where one value ends and 
4 the next begins, are called delimiters. 
| Delimiters between values must be only a single character. Consecutive 
7 : values cannot be separated by, say, both a comma and a tab. Use one or 
: t q the other. See “Missing values” below. 
; : Rows (cases) must be separated by carriage returns (paragraph marks), : 
= Most Macintosh applications can produce text files with these delim- 
bz iters. [he last row (case) of the data file must end with a Return; if it 
4 does not, SYSTAT will not read the last row of data. 
: . : Variable names : 
Bi 8 The file you open may include variable names on the first record (line) 
fy 4 of the file. 
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Data files Importing cata 


If the file does not have vaniable names, SYSTAT names the columns 
COL01, COL02, ..., COL. You may name the variables before read- 
ing the file into SYSTAT, or change SYSTAT’s generic variable names 
after reading the file. 


Character values 
Any character values that contain spaces or single or double quotation 
marks must be surrounded with (the opposite type of) quotation marks. 
Otherwise, character values do not need quotation marks, but you can 
use them if you want. 


If your application does not produce text files that are precisely compat- 
ible with SYSTAT’s requirements, you can use a word processor to fix 
them up. Remember to save in a plain text format from the word pro- 
cessor! 


Missing values 
If SYSTAT encounters two consecutive delimiters in a case, it assumes 
that a missing value appears in that position of the data file. SYSTAT 
then inserts a period (for numeric data) or a blank space (for character 


data). : 


You can also code missing values before imporung the text file—again, 
use periods for missing numeric data and blank spaces surrounded by 
quotation marks for missing character data. 


If missing values are coded some other way (for instance, an application 
might store missing values as ~99), correct them before you attempt to 
do analyses on the data. 


—s Excel SYSTAT can automatically import data files from Microsoft Excel. 
wes | 

e Select Import... from the File menu 

e Choose [Excel] 

e Select the file 

@ Click Open 
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+ Problems? 


q 4 Reading map 
=. files 
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Data files 


Import... can read integer, number, character, label, and blank data 
types. Integer and number fields are converted to numeric variables for 
SYSTAT, blanks are read as missing values, and labels are converted to 
SYSTAT character variables. If any character values or variable names 
are longer than SYSTAT allows, they are truncated. If truncation results 
in duplication, an error results. 


Import... cannot read encrypted or password-protected files. Unprotect 
them first. 


Boolean and formula variables are not read. 


Some of my variables weren't read 
Does the file contain variable names? If so, make sure that all the vari- 
ables have labels and all the labels are in the first record (line, case, row) 


of the file. 


My data got all messed up 
Be sure that the text file doesn’t have two delimiters adjacent to each 


other except to skip over missing values. Remember, only one delimiter 
can appear between each value. If SYSTAT encounters consecutive de- 
limiters, it records a missing value; thus, the next value will end up one 
position “later” and so on. Use a text editor to fix the file before import- 


ing. 


To create a map file, first create an ASCII file set up according to the 
description given in Appendix V of the SYGRAPH volume. Then, con- 
vert that ASCII file to a MAP with Import... Note that the MAP file 
must have a partner data file of the same name without the .MAP ex- 


tension. : 


e Select Import... from the File menu 
e Click [Map] 

e Select your ASCII file of coordinates 
Click Import 
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Exporting data 





You can export data in two different ways: 


1) You can Cut or Copy data into the Clipboard from SYSTAT, switch 
applications, and Paste the data into the other application. 


2) You can save data from SYSTAT in a comma-delimited [Text] for- 
mat or tab-delimited [Tabs] format with Save as..., and read the text 
file into the other application. 


As for importing, it is quicker and easier to use the Clipboard to export 
relatively small amounts of data and to use external text files for larger 
datasets. Also, you need to use text files when exporting to a different 


platform, such as to 2 VAX. 
Copying or You can transfer data from SYSTAT to another application through the 
Cutting into the Clipboard. 


Clipboard 
e Open the file in the Data Editor 
e Select the range of data 
e Select Cut or Copy from the Edit menu 
© Quit from SYSTAT 


e Open the other application 
e Place the cursor where you want the data to appear 
e Select Paste from the Edit menu 


Saving text files You can save SYSTAT data into plain text files with Save as... 


e Select Save as... from the File menu 

© Specify a filename and folder location 

e Choose [Text] for comma delimiters or [Tabs] for tab delimiters 
@ Click Save 

e Select Quit from the File menu 


e Open the other application 
e Read the text file according to that application’s instructions 
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Exporting Oata Data tiles 


Delimiters 
Generally, [Tabs] is preferable for transferring data to another 
Macintosh application, unless that application instructs others. 


Generally, [Text] (comma delimiters) is preferable for transferring data 
to an IBM-PC or mainframe application. 
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Manipulating data files 


SYSTAT lets you sort, rank, standardize, and transpose your data files 


with items in the Data menu. To use these items, you must have a data 
file open in SYSTAT Data Editor. 


Sort... Sort... orders all the cases in the file in ascending order according to up 
to ten numenic or character variables. The results are stored in a differ- 
ent file. You cannot change the current worksheet with Sort... 


e Select Sort... from the Data menu 
e Select the sorting variables, in order 
e Click OK 


ee ee eee 


Select sort variables 


STATE$ 
REGIONS 
REGION 
DIDISION$ = - 


MUL = 





After you click OK, a Save as... dialog box appears so you can specify a 
filename and folder location for the output sorted file. 


If you do not select any variables, SYSTAT sorts using the first variable 
in the file. 
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Manipulating data files Data files 


Nested sorting 
If you select more than one sorting variable, SYSTAT does a nested 
sort. For example, if you select SEX$ and AGE, in that order, SYSTAT 
first sorts the males and females into two groups, and then sorts each 
group from youngest to oldest, so the SEX$ and AGE columns of your 
output file might look like this: 


SEXS AGE 
female 
female 
fernale 
female 

maie 
male 
male 
male 


ne oe 


—— 
ONAMDOO MOM 


Numeric order 
SYSTAT sorts numeric data by putting missing values at the beginning 
of the file and the largest values at the end. 


vi we 
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Character order 
ee, SYSTAT sorts character data in ascending ASCII order, with blanks 
<.. (missing values) at the beginning. The sequence for an ascending char- 
; acter sort is: 
cy 1" #S% &'()*4,-. 
3 0123456789 
ese 13<=>7@ 
ow | ABCDEFGHIJKLM 
. 4 NOPQRSTUVWXYZ 
_ : ET 
Pg abcdefghijklm 
4 nopqrstuvwxyz 
a This means that words are sorted alphabetically with upper case words 
a preceding lower case. 
Bq 
if 
a ‘ ‘4 : 
# = © 1989, SYSTAT, Inc. 107 
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Rank... 
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Manipulating data files 


Rank... replaces values with their ranks in the variable you select. The 
results are stored in a different file. You cannot change the current 
worksheet with Rank... 


e Select Rank... from the Data menu 
e Select the rank variables 
e Click OK 


After you click OK, a Save as... dialog box appears so you can specify a 
filename and folder location for the output ranked file. 


If you do not select any variables, SYSTAT ranks all the numeric vari- 
ables in the file. 


“Rank” is the case number a value would have if the dataset were sorted 
on that variable. For example, the smallest value would be rank 1, and 
the largest value in a fifteen-case variable would be 15. Ties are aver- 
aged. Here is a simple example that shows how ties are handled: 


Value Rank 


| O WON — 
ee) 
cn 


1 
1 


in, Rank... replaces the values of the variable with their ranks, so if 
you need to have both the ranks and the original values in one file, make 
a copy of the variable before ranking. Rank... changes the values only of 
the variable(s) you select, and all other variables are copied to the new | 
file unchanged. You can use Math... to copy a variable, e.g. “Set AGE 
to AGE2.” 


Large files 


Rank... does its work in memory. Therefore, you can run out of room 
with files containing thousands of cases. If this happens, first Sort... the 
file on the variable you want to rank. Then, use Math... to replace the 
values of the variable with CASE numbers. This method wont handle 


tes however. 
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Manipulating data files Data files 


Standardize... 


Transpose... 
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Standardize... replaces values of the variables you select with their 
sample standard scores, or z-scores. The results are stored in a different 
file. You cannot change the current worksheet with Standardize... 


e Select Standardize... from the Data meny 
e Select the variables to be standardized 
e Click OK 


After you click OK, a Save as... dialog box appears so you can specify a 
filename and folder location for the output standardized file. 


If you do not select any variables, the first numeric variable in the file is 
standardized. 


Standardize... subtracts the sample mean from each value and then di- 
vides the difference by the sample standard deviation. The standardized 
values have mean 0 and standard deviation 1. 


Transpose... transposes the cases and variables of a file. That is, the 
variable columns become case rows and vice versa. The results are 
stored in a different file. You cannot change the current worksheet with 
Transpose... 


Two restrictions apply. First, your file cannot contain character vari- 
ables, except the first variable in the file may be a character variable 
containing names for variables in the transposed file. That variable must 
be called LABELS. Second, you cannot transpose a file with more than 
99 cases. 


e Select Transpose... from the Data menu 
e Specify an output filename and location 
@ Click Save 


It is unnecessary to transpose a symmetric matrix (e.g., any matrix pro- 
duced by Corr), because the transposed matrix is the same. 


The transposed file contains one character variable called LABELS that 
contains the old variable names. The new columns (variables) are la- 
beled COL01, COLO2, ..., COL, where n is the number of cases in the 
old file. 
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Saving output in files 112 
Graphs 112 
Save graph as... 112 
Viewing PICT files | 112 
Text 113 
Notes... 113 
Save selected text as... 114 
Results to file as... 114 
Data produced by analyses ltS 
Using saved data 115 
Printing output 116 
Page preview... 116 
Graphs 116 
Print Graph... 116 
Results to printer . 17 
Text 117 
Page formats... 117 
Print selected text... 118 
Results to printer 118 
Data produced by analyses 119 
Placing output in the Clipboard 120 
Graphs 120 
Text ; 120 
Data produced by analyses 120 
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Overview 
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SYSTAT lets you print and save analysis output, graphs, and data pro- 
duced by analyses. You can print and save things one piece at a time, or 
you can tell SYSTAT to print or save everything. You can also place all 
three types of information in the Clipboard for transfer to other appli- 
cations. (The previous chapter, Data files, discussed how to save, print, 
and place in the Clipboard SYSTAT data files.) 


The easiest way to figure out what you’re trying to print or save is by 
knowing what type of output appears in which window. 


The View window is where SYGRAPH draws graphs. Some SYSTAT 
routines (in the Stats menu) also draw graphs in the View window— 
Factor’s factor loadings plots, for example. The View window shows 
graphic output. 


The Analysis window is the window that shows text results—the results 
of operations selected from the Stats menu. When SYSTAT spits out a 
correlation matrix, for example, you are seeing text output. 


SYSTAT and SYGRAPH have an optional command interface, which is 
discussed in Appendix I of the SYGRAPH and SYSTAT volumes. Any 
commands or messages that appear in the Command window are text. 


Finally, you can save data produced by analyses. These are results that 
do not necessarily appear in any window, though some do appear in the 
Analysis window. You can save such things as means, minima, and 
maxima computed by SYSTAT. Or, you can save residuals from regres- 
sion models in a data file. You can save a correlation matrix in a matrix- 
type data file. Generally, you save data from analyses by clicking the 
[Save file] option in the dialog box for that procedure. To learn more 
about saving data from a particular procedure, see the corresponding 
chapter in the SYSTAT manual. 
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Saving output in files 


Graphs 


Save graph as... 


Viewing PICT 
files 
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Graphic output is anything that appears in the View window. You can 
save any graph in the View window into a PICT file. You can open 
PICT files with other Macintosh graphics applications such as 
SuperPaint. You can also bring graphs saved from a previous SYSTAT 
session back into the View window. 


Note that you can transfer graphics to other applications two ways: el- 
ther save a PICT file with SYSTAT and open that file with the other 
application, or copy a graph in to the Clipboard and paste it into the 
other application. See the “Placing output in the Clipboard” section for 
more details on the latter. 


Note that the ACF, PACF, and CCF plots produced by Series items 
appear in the Analysis window and are considered to be text. 


You can save a graph currently displayed in the View window with Save 
graph as... from the File menu. If you have stacked several graphs in 
the View window, scroll to the graph you want to save and then select 
Save graph as... You can only save one graph at a time. 


e Display a graph in the View window 

e Select Save graph as... from the File menu 
e Specify a filename and folder location 

e Click Save 


Graphs are saved in PICT files. You can view a PICT file in the View 
window by double-clicking its icon, which launches SYSTAT and places 
the graph in the View window. You can then use the graph as usual—al- 
ter it with Plot tools and colors..., print it, etc. 


You can also open PICT files with Open... from the File menu. 


e Start SYSTAT 

e Select Open... from the File menu 
e Click [PICT] 

e Select the PICT file you want 

e Click Open 
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Saving output in files Redirecting output 


To use PICT files with another Macintosh application, see the 
documentation provided with that application. 


Text Text is anything that appears in the Analysis window or the Command 
window. 


Both the Analysis window and the Command window are text editors. 
You can type in them and use Cut, Copy, and Paste in them. You can 
select all or part of a window’s content for prinung or saving. You can 
type a batch of commands in the Analysis window, Copy them into the 
Clipboard, and Submit Clipboard for processing. 


4 Thus, you can use the Analysis window to compile quick reports: just 
type your introduction, comments and interpretations right into the 

= output, select all the text, and save it or print it. You can change the font 
Pa of the text with Data/Formats... 
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Tt 
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Notes... You can also use Notes... to place up to five lines of comments in the 
Analysis window. The text you type in the lines with [#1]-[#5] is cen- 
tered at the top of the Analysis window. Using Notes... is equivalent to 

—_ using the mouse to position the cursor in the window and typing the 

notes. 





e Select Notes... from the Data menu 
= ® Type your notes in the boxes 
a @ Click OK : 


Note lines 


. eo ee . 
ae TS 
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Save selected text as... saves the selected text in a file. First, highlight 
the text you want by using the mouse or Select all from the Edit menu. 
(Be sure that the window that shows the text is the active window by 
clicking on it.) 


e Select the text you want to save 

e Select Save selected text as... from the File menu 
e Specify a filename and folder location 

e Click Save 


Note that the file type options are dimmed; this is because you can only 
save text in [Text] files. 


If you name an already existing file, its contents are replaced with the 
new text. Be careful not to destroy previous work. 


Results to file as... saves all subsequent text output sent to the Analysis 
window. Output continues to appear in the Analysis window, and an- 
other copy goes to the file you specify. 


Note that Results to file as... saves subsequent output: if you need to 
save some results already in the Analysis window, you'll have to produce 
them again after selecting Results to file as... (or use Save selected 
text as... to save them by hand to a different file). 


e Select Results to file as... from the Data menu 


e Specify a filename and folder location 
@ Click Save 


To turn off Results to file as..., select Results to window. Otherwise, 
SYSTAT will save output until you quit. 
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Saving output in files 


Data produced 
by analyses 


Using saved data 
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Redirecting output 


You can save data produced by most Stats procedures in SYSTAT files 
by checking the [Save file] box in the dialog box. In some dialog boxes, 
the option has a different name, like [Save residuals]. 


e Select the procedure you want from the Stats menu 
e Select the variables and options for the analysis 

¢ Check the [Save file] (or similar option) box: 

@ Click OK 


Before SYSTAT starts its computations, it gives you the Save as... dia- 
log box. 


© Specify a filename and folder location 
@ Ciick Save 


Then, you sometimes get another dialog box where you choose [Single] 
or [Double] precision and exactly what type of information to save. 
Make your choices and click OK. 


To examine the saved data, close the current data file (saving first if nec- 
essary) and open the file. You can click [Edit] if you want to see the data 
displayed in the Data Editor. Otherwise the file is just opened for fur- 
ther analysis or graphing. 
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Printing output 


a 


Page preview... Page preview... from the File menu lets you see what you're about to 
print. After you select text or display a graph, select Page preview... 
from the File menu. You can move text and graphs, and you can also 
resize graphs using the gray resize box in the lower right corner of the 
graph marquee. The moving and resizing techniques are the same as for 
the ruler option, discussed in the Graphing chapter. 


Ca 


y+ Pees 





SEs! vase 


When you are ready to print, click [Print]. To cancel, click [Exit]. 









Graphs You can print graphs two ways: one at a time with Print Graph... from 
the File menu, or all subsequent graphs and text with Results to 
printer from the Data menu. 


Print Graph... Print Graph... sends the graph currently displayed in the View window 
to the printing device. 
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Printing output Redirecting output 


As with Save Graph as..., the View window must be active. Only the 
current graph is printed. If you want to print all the images stacked in 
the View window, you must print each one separately. 


e Display a graph in the View window 
® Make sure the View window is active 
e Select Print Graph... from the File menu 


Results to printer © Results to printer sends all subsequent graph and text output to the 
printing device. All output is still sent to the Analysis and View win- 


dows. 
= e Select Results to printer from the Data menu 
- a : 
7 Now do your analyses and graphs as usual. To turn printng off, select 
= Results to window from the Data menu. 
pi 


Text You can print text two ways: one piece at a time with Print selected 
text... from the File menu, or all subsequent output (text and graphs) 
with Results to printer from the Data menu. 





F: = _ Page formats... Page formats... lets you specify how output is printed. First, you can 
: enter up to five lines of title text to be displayed at the top of each page 
for printed text output. 


You can also choose from two types of output: [Normal] or [Large 
screen]. These options control how long each line of text is. For in- 
stance, correlation matrices are usually too big to display in one piece, 
Sa | so some columns “wrap” to a subsequent set of lines. If you have a large 
bsg screen (e.g., a standard Mac II screen), you can click [Large screen] so 

E that more columns of text can be shown per line. Your choice of 
[Normal] or [Large screen] also affects the way output is printed and 
saved. 
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Redirecting output Printing output 


Title lines 


® Normal © Large screen 


Cancel 





If you want more than five lines of title text, use the command interface, 
discussed in Appendix I of the SYSTAT and SYGRAPH manuals. The 
PAGE command allows up to ten lines for its TITLE opton. 


Print selected - Print selected text... prints one piece of selected text. First, highlight 
text... the text you want by using the mouse or Select all from the Edit menu. 
(Be sure that the window that shows the text is active.) 


e Select what you want to print 
e Select Print selected text... from the File menu 


Results to printer © Results to printer sends all subsequent text and graphic output to the 
| printing device as well as the Analysis and View windows. 


e Select Results to printer from the Data menu 


Now do your analyses and graphs as usual. To turn printng off, select 
Results to window from the Data menu. 
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Printing output Redirecting output 


Data produced — To print data saved from analysis procedures, first open the file in the 
by analyses Editor. Then, select Print data... from the Editor menu. 


e Select Open... from the File menu 
e Click [Edit] 
e Select the file 


e Click Open 
e Select Print data... from the Editor menu 
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You can use Copy Graph from the Edit menu to copy the current 
graph into the Clipboard. You may then paste it into other applications. 


Note that the View window must be active, or else Copy Graph does 
not appear in the Edit menu. Copy Graph only copies the graph cur- 
rently displayed. You must copy and transfer other graphs stacked in the 


View window one at a time. 


To move text from the Analysis or Command windows into the 
Clipboard, use Copy or Cut from the Edit menu. First highlight the 
text you want by using the mouse or Select all from the Edit menu. (Be 
sure that the window is active.) 


To move data saved from Stats procedures into the Clipboard, first 
display the file in the SYSTAT Data Editor. 


e Select Open... from the File menu 
e Click [Edit} 

e Select the file 

e Click Open 


Next, highlight the data you’ want (all or any rectangular range of cells 


in the Editor) by using the mouse or Select all from the Edit menu. 
Finally, use Cut or Copy to place the data in the Clipboard. 
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Placing output in the Clipboard 
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Overview The first section of this chapter, “Setting up,” discusses some ways you 
can adjust SYSTAT according to your preferences. 


The second section, “Using dialog boxes,” describes how the dialog 
boxes work and ways you can use them more efficiently. You can, 
among other things, save all of your dialog box settings from a particular 
session and restore them the next ume you need to work on those 
data—so you hit the ground running in your next session. You can also 


- set up “bundles” of variables so you only have to look at the variables 
that interest you, yet you can access the other ones whenever you need 
them. 


The third section, “Analyzing subgroups” shows how to analyze data by 
groups through the use of grouping variables and how to analyze only 
variables that meet a given test condition (for instance, all the New 
England states but no states from other regions). 


The final section, “Weighung datasets,” shows how to weight cases in a 
dataset by a weight variable. 
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Here we discuss some options that can help make SYSTAT and 
SYGRAPH more efficient. 


Formats... lets you control how SYSTAT displays text output. Show 
options in effect shows all the settings you’ve chosen for Formats... 
and Page formats... Record preferences stores your settings so that 
they will automatically be chosen whenever you start SYSTAT. 


Formats... controls the way SYSTAT displays text output. 
e Select Formats... from the Data menu 


e Make your choices 
® Click OK 


Decimals to show for {Analysis results [] Graphics 


000102 @©3040506070809 


@ Standard © Extended results 


Text font Size 
[] Menu click & Plain 


Bold alte 


C undertine | NB Helvets 


[Outline | NB! Helvel_] 





C] Shadow New Centiy, 
{ ] Condense 
[] Extend 





All Formats... settings stay in effect for the duraton of your SYSTAT 
session unless you change them again. If you use Record preferences 
the settings are also chosen at the beginning of every subsequent 
SYSTAT session. 
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Setting up Analysis 


Warning: in the SYSTAT volume, we occasionally ask you to change a 
Formats... setting for an example. It is understood that unless we state 
otherwise, all the default Formats... are used. We assume that if you try 
an example, you will restore the defaults as necessary. We usually do not 
remind you to change Formats... back. If your output looks different from 
that shown in the manual, check your Formats... settings. 


[Decimals to show] 
You can control the number of decimal places that are shown in analysis 
output. The default setting, 3, means that every number shown in anal- 
ysis results is printed with three digits past the decimal. You can show 
from zero to nine places, and you can control [Analysis results] (text 
output) and [Graphics] (View window plots) separately: 


e Select Formats... from the Data menu 
¢ Choose the number of decimal places for [Analysis results] 
e Click OK 7 


Select Formats... again 

Check the [Graphics] box 

2 Choose the number of decimal places for graphics 
- Click OK 





La [Standard] or [Extended results] 

ee For some SYSTAT procedures, you have a choice of [Standard] or 

: {Extended results]. [Standard] results are the default and contain only 
ae the most common statistics. [Extended results] show some additional 
output for some procedures. 


Your choice stays in effect until you change it or until you quit 
SYSTAT. If you only want extended results for one regression model, 
for example, switch to extended results before you estimate the model, 
esumate the model, and then switch back to standard results. 


[Menu click] 

If you check the [Menu click] box, you hear sound effects when you use 
a. SYSTAT’s menus. These sounds might make it easier to know when 
oe you are moving through menus. If they merely annoy you, leave the op- 

a tion unchecked. 
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Analysis Setting up 


[Scroll analyses] 
Ordinarily, SYSTAT clears the Analysis window before it displays more 
results. If you prefer that all results accumulate, click [Scroll analyses]. 
New results are appended to the old, and the window scrolls. 


One possible problem with scrolling analyses is that you may run into 
memory problems. If you do, just delete any text that you don't need 
any more. Close the Analysis window to clear it completely. 


[Text font] and [Size] 
SYSTAT displays text results in boldface 9 point Monaco by default. 
You may choose any font and size available on your machine; the selec- 
tion will vary according to which fonts you have installed in your System 
file or in Suitcases. (See your Macintosh System Software User’s Guide.) 
You may also choose from the various sizes. 


Warning: you should generally choose a monospace (non-proportional 
width) font such as Monaco or Courier, because proportional width 
fonts (such as Helvetica and Times), where some characters are wider 
than others, destroy the neat columns of SYSTAT output. If you prefer 
to use a proportional font for a report, you will need to replace blank 
spaces with tabs with your text processor. 


Show options Show options in effect from the Goodies menu tells what file is in use 
in effect and what settings have bee chosen. 





SYSTAT options now in effect 


File in use 15 
Disk: SYSTAT:Data Files: USDATA 











SYSTAT output display options are 
SYSTAT prints standard results 
in narrow page format, scrolls screen output, and displays 


Numeric output values print to 3 decimal places 
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Setting up 


Record 
preferen ces 





Analysis 


Record preferences from the Goodies menu records all the setnngs 
you have chosen in Formats... and Page formats... from the Data 
menu and Formats... from the Editor menu. It also records any 
changes you make to the size or locations of windows. Also, any win- 
dows that are open when you use Record preferences will be opened 


every ume you start SYSTAT—even if the window is ordinarily not 
shown. 


Your preferences are stored in a file called SYSTAT Pref in your 
SYSTAT folder. If you want to restore the defaults, simply drag 
SYSTAT Pref out of your SYSTAT folder. 
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Using dialog boxes 


a 


SYSTAT dialog boxes work pretty much the way you expect. However, 
they have some convenient new features. 


Selecting To select a variable or function from a selection list, you may either 

variables and double-click the variable or function, or click it (to highlight it) and then 

functions click the [Select] button beneath the selection list. If you use the latter 
method, be sure to use the [Select] button underneath the correct selec- 
tion list. 


Some dialog boxes have two variable selection lists. Usually the first 
(leftmost) list is the main selection list, and the second list is for select- 
ing grouping variables. 


Functions When you select functions or operators, be sure that the argument of 
the function goes inside the parentheses. For instance, if you want 
ABS(RESPONSE), select ABS. Notice that the cursor is situng inside 
the parentheses. If you now select RESPONSE, it will be placed cor- 
rectly inside the ABS parentheses. You must be careful, however, when 
constructing more complex expressions. If you must, move the cursor 
with the mouse and type all or part of the expression from the keyboard. 

Keyboard If you don’t like using a mouse, you can use a number of keyboard 

shortcuts | shortcuts. 


You can always type in the boxes where your selections are placed. You 
can edit selections as needed, type names of variables that aren’t in the 
current bundle (see below), and type any symbols (such as + or -) that 
aren’t available from selection lists. 


Return for OK 
You can press Return to set a procedure in motion; this 1s an alternative 


to clicking OK. 


Moving from box to box 
Some dialog boxes, like the Recode... box, have several boxes where 
expressions are entered. You can move between boxes with the mouse or 
by pressing the Tab key. 
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Using dialog boxes 


Options 











Analysis 


Most of the dialog boxes contain optons. There are three types of op- 
tions. 


An exclusive option is one that prevents you from picking others; if you 
choose another, the first one turns off. Exclusive optons have radio 
buttons. For example, many dialog boxes ask you to choose [Listwise] or 
{Pairwise} deletion of missing values. The radio buttons tell you that 
you can only pick one or the other. SYSTAT will not let you pick both. 


@ Listwise 
© Pairwise 


A nonexclusive option is one that can be chosen in combination with 
others. Nonexclusive options have check boxes. You can click any of the 
optons to put an X in the box (to “check” the box), and you can click a 
“checked” box to “uncheck” it or turn it off 


[J] Cv &) sb 
CjKurtosis [SEM 
kJ Meximum []Skewness 


i] Mean _] Sum 
KJ Minimum f& Variance 
i) N ; 


() Range CJ} AN 


Nonexclusive options sometimes appear alone (like [Save file]), and you 
Just decide whether they should be on or off. 


A third type of option takes an argument. For instance, if you want con- 
fidence intervals for Tables/Tabulate..., you must specify the confi- 
dence level. To turn on the (Confi} option, type a value in the box. 


Conti = [95 | 


If you ever decide that you don’t want to perform an analysis after all, 
Just click Cancel instead of OK. SYSTAT closes the dialog box and lets 
you select some other procedure. 


If you make mistakes selecting variables, you can either edit the box by 
hand or click Clear to erase all your selections and start over again. 


Analysis Using dialog boxes 


After you click OK to execute an analysis, SYSTAT “remembers” the 
choices you made in the dialog box: the variables you selected, the op- 
tions you checked, etc. The next ume you go to that dialog box, you see 
those same selections. You can click Clear to clear out the selection. To 
undo options, you must click them off. 


If you use a dialog box to do one problem and then return to it to do a 
completely different problem, you will need to click [Clear] to remove 
earlier selections and look over the options to be sure you have what you 


want. 
Saving dialog If you frequently do similar analyses on the same dataset in separate 
box settings SYSTAT sessions, you may save ume by restoring a previous day’s dia- 


log box settings. All the variable-selecting and opuon-clicking that you 
did before is already there, and you can save ume if you need to do al- 

most the same problems—you can just uncheck an option or select an- 
other variable and go. 


e Select Save as... from the File menu 
e Click [DLOG] 

e Specify a filename and folder location 
e Click Save 


The next ume you want to use those dialog setungs, open that file: 


e Start up SYSTAT 
e Select Open... from the File menu 
e Click [DLOG] 

e Select the file you saved 

e Click Open 


You can save as many files of setungs as you want, but you can only 
open one at a ume. If you open a second DLOG fle, the first file’s set- 
tings are replaced in all dialogs by the second file,s settings. 


Variable If you have a dataset with many variables, it may not be convenient to 
bundles scroll through all the variable names before you come to the ones you 
want to select. 


130 © 1989, SYSTAT, Inc. 







ee ne we Sagoatty wn FS gi Ea ntReS Ss 
‘ 4 reek ape Seve ™ 
: mS nee ee oe oi 


RF . oa” 


BE cet 
- aes ats ee 






* s : . 
“i - . oP . ¥. #T * ie ray 
ri. e . . 


Using dialog boxes 


Setting up 
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Analysis 


You can set up variable bundles to make variable selection easier. What 
you do is define a list of variables to belong to 2 bundle and then choose 
that bundle, Now, anywhere you would see a variable selection list, you 
will only see the variables in that bundle listed. 


You can set up as many as five bundles at a ume. 


e Select Define bundles... from the Goodies menu 

e Choose a bundle: click [Bundle 1], [Bundle 2], or whichever bundle 
you want to define 

e Select the variables that you want to belong to that bundle 

© Choose another bundle and select its variables, etc. 

e Click OK 


Notice that the tall box at the night lists all the variables selected for the 
current bundle. 


(7) STATES 


REGIONS 
REGION 
DIVISIONS 
GIUISION 


® Use ail vars 


© Bundle | 
© Bundle 2 
© Bundle 3 
© Bundle 4 
© Bundle 5 





The current bundle at the time you click OK becomes the current bun- 
dle for dialog boxes. 


Analysis 


Switching 
bundles 
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Using dialog boxes 


Suppose you selected the three character variables, STATES, 
REGIONS, and DIVISIONS, for Bundle 1. Now, when you select 


Stats/Tables..., for instance, you see those variables in the selecuon 
list: 


STATES 
REGIONS 
DIVISIONS 





If you don’t select any variables for a procedure, the usual defaults are 
either to use the first numeric variable in the file or all numeric variables 
in the file. If, however, you are a using a bundle, the defaults are either 
the first numeric variable in the bundle or all numeric variables in the 
bundle. In this respect, using variable bundles is a way of selecting sub- 
sets of variables rather than cases. (Select... selects subgroups of cases 
and is discussed in the next section, “Selecting subgroups.”) 


If you ever need to use a varjable that does not belong to the current 
bundle, you can either type its name by hand in the selection box or 


switch bundles. 


You can switch bundles two ways. Either go back to Goodies/Define 
bundles..., choose a bundle, and click OK, or use the bundles icon in 


any dialog box. 


The bundles icon in the upper left corner of any dialog box lets you 
change bundles. Press the mouse button and hold it on the icon. A list 
pops up. Select the bundle you want, or select [All] to use all the vari- 


ables in the file. 
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Using dialog boxes Analysis 


Keyboard shortcut 
Another way to switch bundles from inside a dialog box is to hold the 


Command key (9) and type a bundle number. Type Command-0 (zero) 
to use all variables. 


Saving bundles 
If you save a DLOG file (as discussed above), your bundles will be saved. 


: a 7 
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Analyzing subgroups 





By groups... You can analyze subgroups with most SYSTAT procedures with 
Data/By groups... 


e Select By groups... from the Data menu 
e Select the grouping variable(s) 
e Click OK 


STATES 
REGIONS 
REGION 
DIVISIONS 
DIVISION 


[] By groups off 


[j Exclude missing 
(] N@w page 





You may specify up to ten grouping variables. Your file must be sorted 
hierarchically on those variables. SYSTAT creates a new subgroup every 
time it encounters a new value on any grouping variable. Procedures 
perform analyses separately for each group. 


See the Data Editor chapter under the “Special data structures” for in- 
formation about grouping variables. 


[By groups off] 
Your grouping variables stay in effect unul you quit from SYSTAT or 


open a new data file. You can tum them off by selecung By groups... 
again and clicking [By groups off]. 
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Using dialog boxes Analysis 


[New page] 
The [New page] option causes SYSTAT to begin output for each by 
group on a new page for printed output. 


[Exclude missing] 
You can check the {Exclude missing] box to keep SYSTAT from analyz- 
ing cases that have missing values for grouping variables. 


Select... You can select a subgroup of cases from a file with Select... 


; e Select Select... from the Data menu 

* ¢ Build a test expression by selecting variables and typing as neces- 
* sary; use the [And] button for logical AND statements 

e Click OK 


Specify each condition as a numeric variable name followed by a relation 
followed by a number. The six possible logical relations are =, <>, <, >, 
<=, and >=. You may also test for character variables having a certain 
character value. Character values must be surrounded by single or dou- 
ble quotation marks. Character comparisons are case sensitive. 
















Use whatever combination of selecting, typing, and mousing necessary 
to build the test condition. If you need to use and, build one condition, 
click [And], build the next, etc. Each completed condition is moved to 
the large box in the lower right corner. 


STATES 
REGIONS 
REGION 
DIVISIONS 
DIVISION 
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Analysis Analyzing subgroups 


You may specify up to ten conditions in a Select... statement. SYSTAT 
analyzes only those cases which meet ai! the conditions (1.e. you can use 
the logical [And], but [Or] is not available). 


Turning selections off 
Your selection stays in effect until you quit SYSTAT or open a new data 
fle. To turn off the subgroup selection, select Select... again, click 
(Clear] to clear the test conditions from the box(es), and click OK. 


With procedures that can save SYSTAT files (such as MGLH), you can ; 
use Select... to compute a model for some cases and then apply the , 
model to other cases. For example, you can compute a regression on one 

set of cases and then use the estimated model to predict values on other 

cases. Just Select... a value on some variable for those cases to include 

in the model. 


Other ways to The Select... item in the Editor menu is a more powerful version of 

select subgroups  Data/Select... First, Editor/Select... provides both [And] and [Or]. 
Second, if you select cases using Editor/Select..., you can later extract 
the cases to a separate data file using Editor/Extract selected... 


The scatterplot brushing tools let you select cases visually from scatter- 
plots. To use scatterplot brushing tools, you must have both the Editor 
and View windows open, and (obviously) you can only work from a scat- 
terplot. Since the Editor is open, you can also use Extract selected... 


Data/Select... is only preferable to the other methods when your data 
file is bulky, since the Data Editor can be slow for large files. Both other 
methods are in the Editor menu, which is only present when the Editor 
is open. : 


The three ways to select cases are compared in the Data Editor chapter. 
Editor/Select... is discussed in detail in that chapter also. The scat- 
terplot brushing tools are introduced in the Exploratory data analysis 
chapter, later in this volume. 
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Weighting cases 





Weight... 












Weighting replicates the cases in a dataset according to a given weight 
variable. You can use case weighting to “unpack” frequency distribution 
datasets by weighting by a COUNT variable. (Frequency distributions 
are discussed in the Data Editor chapter in the “Special data structures” 
chapter.) 


Weight... allows you to specify a weighting variable for statistical analy- 
ses. SYSTAT replicates each case the number of times specified by the 
weighting variable. 


e Select Weight... from the Data menu 
e Select a weighting variable 
e Click OK 


Any analyses you do subsequently will use the weighted version of the 
dataset. 


STATES 
REGIONS 
REGION 
DIVIStONS 
DIVISION 
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Analysis Weighting cases 


For each case, SYSTAT truncates the value of the weightung vanable to 
an integer. SYSTAT uses the sum of the weights as the sample size. If 
the value of the weighting variable is less than 1 for a given case, 
SYSTAT excludes that case from analysis. 


Cross validation In some procedure modules, such as MGLH, you can use weights of 
one to identify the cases you wish to use to estimate a model and 
weights of zero for cases for which you only want predictions or classifi- 


cations. 
Saving analyses Statistical modules which can produce SYSTAT files save the un- 
after weighting weighted number of cases after an analysis, i.e. the same number of cases 


in the input file. This way, you can merge results with the original data 
file without additonal work. 


[Weights off] 
Weighting stays in effect until you quit SYSTAT or open a new data 
file. To turn weighting off, select Weight... again and click [Weights 


off] and OK. 
Non-integer This method of weighting is different from some programs that allow 
weighting weighting of variables but do not change the total sample size. Often, 


this type of weighting involves non-integer weights. To do this type of 
weighting in SYSTAT, use DATA to muluply the data by the weightng 
variable. This is the type of weighting done with the SAS WEIGHT 
option (as opposed to FREQ). The DATA facilites are discussed in de- 
tail in the DATA volume, but here is a brief example: 


DATA 


USE CAPITAL 

SAVE WCAPITAL 

LET INCOME=INCOME*ABILITY 
RUN 


The example weights INCOME by the product of INCOME and 
ABILITY and saves the results in the file WCAPITAL. 
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Graphing 





Sizing and positioning graphics 
The ruler tool 
Positioning graphs by mouse 
Resizing by mouse 
Locating by typing 
Resizing by typing 
Example 
Resize box 
Moving graphs 
Resizing graphs 
Resizing example 
Commands 


Stacking and overlaying 
Overlay, stack, and next 
Overlay plot 
Stack plot 
Next plot 
Movie... 
Time series movie example 


Options 
Color... 
Formats... 
Global settings... 


Drawing tools 
Plot tools and colors... 
- Rectangle 
Rounded-corner rectangle 
Oval 
Circle 
Line 
Text 
Pointer 
Colors 
Hints 
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Graphing 
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Overview 





The first section of this chapter is about “Sizing and positioning graph- 
ics” in the View window. You can control the size and location of graphs 
in the View window by using the ruler option in every Graph menu dia- 
log box. You can also move and resize graphs once they are drawn. 


The next section discusses “Stacking and overlaying graphs” in the View 
window. Whenever you already have a graph in the View window and 
you ask SYGRAPH to plot another, you have a choice of laying the new 
one over the previous one, or scrolling the first one up in the window 
and drawing the new one on a new “page” in the window, or clearing 
out the View window and drawing the new one. 


The “Opuons” section introduces some global settings for graphs that 
are discussed in detail in the Common options of the SYGRAPH 
manual. Here, we just introduce them so you know what the possibilities 
are; you can skip ahead to SYGRAPH manual whenever you’re ready. 


The “Drawing tools” section introduces SYGRAPH’s new MacDraw- 
like tools for preparing presentation graphics. SYGRAPH’s Plot tools 
and colors... are object-oriented text and drawing tools that let you add 
shapes, lines, and text in full color. 
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Sizing and positioning graphics 


The ruler tool Every plot dialog box has a ruler icon alongside its palette of options. 
When you click this ruler, a page preview-style dialog appears. This dia- 
log box lets you adjust the height, width, and position of the graph-to- 
be visually or by typing measurements. 


e Click the ruler icon 
e Specify size and position visually or by typing numbers 
® Click OK 


Height [2.50 ]in  ¥ origin [ 15.0] 7% 
width | 2.50]in vorigin [ 8.5 |% 


LJ Stiaiw graph 





The dialog box shows a marquee representing the size and location in 

the View window of the graph that will be drawn. The marquee shows 
only the graph size—it does not contain any scales, labels, utles, and so 
on. The graph is strictly the axes and drawn portions of a graph. 


The gray box in the lower nght comer of the marquee Is a resize box. 
You can drag the resize box to change the size and shape of the marquee 


(and thus the graph). 
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Sizing and positioning graphics Graphing 


Positioning 
graphs by mouse 






As you can see, SYGRAPH’s default is to place a two and one-half inch. 
square graph in upper center of the View window. The rulers along the 
top and side show the current size of the View window. If you change 
the size of the View window, the ruler dialog box changes accordingly. 


[Show graph] 


If you already have a plot on display in the View window, you can click 
[Show graph] to see a sketch of the plot in the screen. This enables you 
to position subsequent graphs relative to previous ones easily. With 
[Show graph] you can ensure that overlayed graphs and stacked graphs 
will align neatly. 


To change the location in the View window of a subsequent graph, po- 
sidon the pointer inside the marquee. Notice that the pointer is now 
represented by a hand. Press the mouse button, drag the marquee to its 
new positon, and release the button. 





Height [ 2.50 Jin H origin [ -8.3 |% 


2 height {fin Lj Straw graph 
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Graphing Sizing and positioning graphics 


Resizing by To change the size and shape of the marquee with the mouse, position 
mouse the pointer over the gray resize box in the lower right corner of the 
a marquee. Press the mouse button, drag the resize box to another loca- 


ton, and release the button. 


Height 1.88 jin 4 origin [ -8.3]% 
[ 3.31 Jin Y origin | 9.5|% 


CJ) Straw graph 





Positioning by You can position a graph in the View window by typing coordinates for 
typing the [X origin] and [Y origin] options. The coordinates you specify de- 
termine the location of the lower /eft corner of the graph. 


Nouce that the coordinates change automatically when you use the 
hand to drag the marquee. 


1 
} 
i 










The units are percentage of the total current size of the View window, 
and the coordinates measure the distance up and over from the spot 
near the lower left corner of the View window that SYGRAPH consid- 
ers to be (0,0). You can find where (0,0) is for a View window of a par- 
ticular size either by moving the marquee around and watching the co- 
ordinates or by specifying 0 for [X origin] and [Y origin]. 


Resizing by You can specify graph size in inches by typing values in the [Height] and 
typing [Width] boxes. You can also specify a [Z height] for 3-D plots. 
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sizing and positioning graphics Graphing 


Example 






We are going to use the ruler to place to density plots in the View win- 
dow: a fuzzygram of average WINTER temperatures from the US 
dataset, and a frequency polygon of the same variable. We want to stack 
them vertically so we can compare the information that the two plots 


show. 


First, we must resize the View window so that it will be large enough to 
accommodate two 2" by 3" plots. If the View window is not already 
open, select Show view window from the Goodies menu. 


Now, use the resize box in the lower right comer of the window to 
make the window about 4 inches wide and 6 inches tall. (If you have a 
smaller screen, just adjust all these sizes as needed.) Now we're ready. 


Select Density/Fuzzy from the Graph menu 
* Select a variable: WINTER 
@ Click the ruler icon 
e Use the resize box to make the marquee 2 inches tall and 3 inches 
wide (watch the values in the [Height] and [Width] boxes 
e Use the hand to move the marquee to the top, center part of the 


screen 
® Click OK 
® Click OK 





Height [2.00 ]in x origin [-7.5 |% 
width [3.0q]in origin [ 47.8] 7 


Zhe 
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Sizing and positioning graphics 


SYGRAPH draws the fuzzygram. If your fuzzygram doesn’t come out 
exactly where you wanted it, pick it up and move it in the view window. 
(Instructions for that appear in the next section, “The resize box.”) 
When you're satisfied, plot a frequency polygon in the lower part of the 
window. 


e Select Density/Polygon from the Graph menu 
e Select a variable: WINTER 
@ Click the ruler icon 
e Use the resize box to make the marquee 2 inches tall and 3 inches 
wide (watch the values in the [Height] and [Width] boxes 
e Use the hand to move the marquee to the bottom, center part of 
the screen 
e Click OK 
e Click OK 


Note that in this step you can check the [Show graph] box to make sure 
you are positioning the next graph correctly. But don’t worry—if you 
are a little off, you can move the graph in the View window to fix it. 


Height [2.00 ]in origin [ -7.5 ]% 
width [3.00 |in Yorigin [-10.1]% 


2 height | jin [] Show graph 
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Sizing and positioning graphics Graphing 


Before SYGRAPH draws the next plot, it displays an Overlay plot/Stack 
plot/Next plot. We discuss this dialog box in the second part of the 
chapter, “Stacking and overlaying.” For now, just click [Overlay plot]; 
this means that you want the graph to appear on the same page as the 
fuzzygram. 


If you are not satisfied with the position of the plot, just pick it up and 
move it. Here’s how our View window looked when we had gotten ev- 
erything adjusted to our satisfaction: 


SYSTAT View 
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Resize box 


Moving graphs 


Resizing graphs 


Resizing example 
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Sizing and positioning graphics 


After a graph is plotted in the View window, you can move it and resize 
it visually. (If you have a scatterplot brushing tool selected, first click it 
off before attempting to move or resize the graph. ) 


To move a graph, just position the pointer somewhere inside the graph. 
When you press the mouse button, a marquee appears around the 
graph. The graph (everything inside the marquee) is an ofject, and you 
can move the entire object. You can read more about objects in the 
“Drawing tools” part of this chapter. 


Thus, to move the graph, just press the mouse button inside the graph, 
drag the graph to a new location, and release the mouse button. 


The light gray box in the lower nght corner of the graph marquee is a 

resize box. It works the same way as the resize box you saw in the ruler 
dialog box. Just press the mouse button on the box, drag it to a new lo- 
cation, and release the button. The graph is resized accordingly. 


Here we draw a box plot with SYGRAPH’s default size and then resize 
itin the View window. We are using the US dataset. 


e Select Box from the Graph menu 

e Select a variable: WINTER 

e Click OK 

After the plot is drawn, click on it to show the marquee and resize box. 
Then, drag the resize box to a new location. 


SERS SSG HGE SCE TES ECE TERE 
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Sizing and positioning graphics Graphing 


The result is a box plot that looks like a Cinemascope movie on TV. 


POS PAOR TRATES AAT HAGE RS SASH TESESHESHSAAEHESESCSSOESSAA SAS ASHSASE SHLD 


Commands If you choose to use the optional command interface for SYGRAPH, 


you can use the HEIGHT and WIDTH (and ZHEIGHT) commands 
to adjust graph size. See Appendix I: Command reference in the 
SYGRAPH volume for information about SYGRAPH commands. 
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Stacking and overlaying 


Overlay, stack, | You might want to superimpose one graph onto another or place two 

and next graphs side by side to compare them. Whenever there is a graph in the 
View window and you ask for another graph, SYGRAPH displays the 
following dialog box: 


Overlay plot 


Stack plot 





Overlay plot If you want to superimpose a new graph onto the one already in the 
View window, click [Overlay plot]. SYGRAPH draws the next plot di- 
rectly on top of the current plot. If you want the next graph in some 
other location, use the ruler to trace the new size and position before 
clicking OK to start drawing the graph. 


See the “Sizing and positioning graphics” section above for an example 
of overlaying. 


Stack plot {Stack plot] scrolls the current graph upward in the View window before 
drawing the next plot. The vertical scroll bar becomes active, and you 
can scroll up and down through all the plots that the window accumu- 
lates. Once you have stacked plots in the window, you can use 
Goodies/Movie... to view them in quick succession. 


If you submit a batch file of commands (see the Command reference 
in the SYGRAPH manual) that draws several graphs, SYGRAPH auto- 
matically stacks them; you don’t need to click the [Stack plot} button 
yourself. If you want to overlay graphs, place them berween BEGIN and 
END statements. _ 
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Stacking and overlaying Graphing 


Next plot [Next plot] clears all previous graphs from the View window before 
drawing the next. If you have stacked plots, aif the plots are removed. 
The vertical scroll bar is disabled since you have only a single plot in the 
View window. 


Movie... You can view a stack of graphs in quick succession for a movie-like ef- 
fect. Movies are especially useful for monitoring the effects of a series of 
transformation, extractions, smoothings, or weighting. Astronomers use 

Pe ‘ é . 
a “flicker box” to reveal objects that move slightly from time to time. 
You can do this with two scatterplots in a Movie. 


Display a plot 

Do another plot, and use [Stack plot] to scroll the previous plot up- 
ward in the View window 

Continue stacking 

Select Movie... from the Goodies menu 

Specify the wait period between graphs in tenths of a second and 
click OK 


Specify the wait period between cycles in tenths of a second and 
click OK 


© 














To stop the movie, press the mouse button and hold it briefly, until the 
movie stops. 


Enter time between frames Enter time between cycles 


Tenths of a second between frames, 


Tenths of o second between cycles, 


Zero for no wait [ 


zero for no wait [| 





You can resume stacking graphs after viewing a Movie, and you can do 
another Movie any time. You do not lose the graphs in the View win- 
dow untl you close the window or click [Next plot]. Movie... does not 
clear the window. 


Low or high resolution? 
You can do movies with high resolution (PostScript) graphs (use 
Options/Global settings...), but since high resolution graphs draw 
quite slowly, we do not recommend it. 
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Graphing Stacking and overlaying 


Time series In the Series chapter of the SYSTAT manual, we discuss the 4253H fil- 
movie example ter of Tukey. 


e Running median smoother, window 4 
e Running median smoother, window 2 
e Running median smoother, window 5 
® Running median smoother, window 3 
e¢ Running means smoother, window 3, weights 1, 2, 1 


A movie can show beautifully how the smoother gradually tames the 
series. Since we can’t print a Movie in the manual, you'll have to try this 
yourself. We’re using the AIRCRAFT dataset. After each smoothing, 


we'll do a time series plot. 


First, here’s a plot of the original series: 


e Select Series/Case plot... from the Stats menu 
e Select the variable: FLUTTER 

e Set [Lag] to 64 

@ Click OK 


== 
« 


(If you had something in your View window already, click [Next plot] to 
start a fresh stack.) | 









-4 
0 10 20 30 40 50 60 70 


CASE 


Now we smooth and plot again. Since the vertical scale values are likely 
to change after we tame the series, we set [Min] and [Max]. This way, all 
of the plots will be on the same scale and will be easily comparable. 
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Stacking and overlaying Graphing 


e Select Series/Smooth... from the Stats menu 
e Select the variable: FLUTTER 

¢ Choose [Median] smoothing 

e Set [Median window=] to 4 

@ Click OK 


e Select Series/Case plot from the Stats menu 


e Set [Min] to —1 and [Max] to 3 
@ Click OK 


We click [Stack plot] and the first smoothing is displayed. 


3 
2 
it 
E 1 
rT 
0 
. -4 
seg 0 10 20 30 40 50 60 70 
ba ; CASE 


Now, we do the next smoothing and plot again. We need only change 
the smoothing window size, since the variable and type of smoothing are 
the same. We don’t need to do anything in the Case plot dialog box 
since we want all the same settings. 


e Select Series/Smooth... from the Stats menu 
e Change the [Median windows] to 2 
® Click OK | | 


Select Series/Case plot from the Stats menu 
Click OK 
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Graphing Stacking and overlaying 


FLUTTER 


0 10 20 30 40 50 60 70 
CASE 


Select Series/Smooth... from the Stats menu 
Change the [Median window=] to 5 
Click OK 


Select Series/Case plot from the Stats menu 
Click OK 


0 10 20 30 40 50 60 70 
CASE 


Select Series/Smocth... from the Stats menu 
Change the [Median windows] to 3 
Click OK 


Select Series/Case plot from the Stats menu 
Click OK 


ae 
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Stacking and overlaying Graphing 


0 10 20 30 40 50 60 70 
CASE 


Now, the final Hanning step. 


e Select Series/Smooth... from the Stats menu 
e Choose [Mean] smoothing, window 3 
e Set [Weight] to 1, 2, 1 


e Click OK 
: e Select Series/Case plot from the Stats menu 
= e Click OK 
# 2 
i = 
“ c 
* : 
Be -4 
Fa O 10 20 30 40 50 60 70 


CASE 
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Graphing Stacking and overlaying 


Were all set: we have the original series and five smoothed versions all 
plotted and stacked in the View window. It’s time for a movie. 


e Select Movie... from the Goodies menu 
e Specify 0 for time between graphs and click OK 
e Specify 10 for time between cycles and click OK 


SYGRAPH should now flip through your stack of graphs, pausing for a 
full second before starting back at the original series. When you have 
seen the Movie enough times, press and hold the mouse button until it 
stops. 


A sextuple-exposure shot of the Movie is the best we can do here... 


a ne 





> 
OS i ek Le ke ee 


Popcorn is optional. 
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Sale ee 


Options 


Color... 


Formats... 


Global 
settings... 
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The Options item in the Graph menu has three items for controlling 
graph features globally—that is, for setting conditions that are in effect 
for all graphs, all the time, always. (Unul you change them or quit.) All 
three items are discussed in greater detail of the Common options 
chapter of the SYGRAPH manual. We merely menton them here so 
you're aware of them. 


With Color... you can choose the color settings to be used for all sub- 
sequent plots. You can choose colors for the “Background,” 
“Foreground,” “Graph,” and “Labels.” Definitions and instructions for 
these appear in the Common options chapter of the SYGRAPH man- 
ual. 


Formats... lets you control placement of plots being overlayed in three- 
dimensional perspective; scaling to expand or reduce plots; color sepa- 
rations; line thickness; and fonts, sizes, and styles. All these a are discussed 
in the Common options chapter. 


Global settings... offers three on-off switches that control the appear- 
ance of graphs. 


[Automatic graph font scaling] automatically resizes text in graphs when 
you resize graphs: if you make the graph bigger, SYGRAPH makes the 
text bigger to match, and the reverse. 


{Graph scales as editable text] lets you control whether the scales (the 
numbers marking axes) can be edited with the Plot tools and colors... 
or whether they are fixed to the graph and unchangeable. 


Finally, [High resolution] controls whether graphs are drawn with 


Quickdraw commands (low resolution) or PostScript commands (high 
resolution). 


157 


Drawing tools 


tools and You can draw and write on standard resolution graphs in the View win- 

colors... dow using SYSTAT’s plot tools. The tools wil! be familiar to users of 
MacDraw and similar object-oriented graphics packages for the 
Macintosh. 


e Display a graph or open a PICT file 

¢ Select Plot tools and colors... from the Goodies menu 
e Click the View window to make it active 

¢ Choose a tool by clicking it 


You may use both scatterplot brushing tools (discussed in the next 
chapter, Exploratory data analysis) and drawing tools on a given 
graph. You.can draw on the graph with any of five tools: rectangle, 
rounded-corner rectangle, circle, oval, and line. You can add text with 
the text tool (A) and edit any of the objects (including the graph itself) 
with the pointer. You can change the color of any object. 





Plot tools are not available with [Hi gh resolution] graphs. Be sure to 
check off that option in Options/Global settings... of the Graph 
menu. 

Terminology 
We have called the Plot tools and colors olject-oriented. What this 
means is that any shape, line, or text that you add is an object that you 
can move, remove, or change as a distinct entity. 


Note that the graph that SYGRAPH draws is also one object, and if you 
checked on the [Graphs scales as editable text] box in Options/Global 
settings..., each scale (the numbers showing the range of axis values) : 
are separate objects also. You can draw, resize, and remove the objects 
SYGRAPH draws the same as you can the objects you add with the plot 

tools, 


The only exception is that you cannot change the colors of the graph 
itself with the tools—you must choose colors with Options/Color... or 

the [Colors] option before plotting. You can change scale colors with the 

tools. 
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eae mac ara. Se ee is sa eee ee 
«ini 
Drawing tools Graphing 
Rectangle To draw a rectangle in the View window, click the rectangle icon in the 


toolbox. Posidon the pointer in one corner of the area where you want 
the rectangle, press the mouse button, drag to the diagonally opposite 
corner of the rectangle, and release the button. 
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Rounded-corner The rounded-corner rectangle works the same way as the rectangle. 








"rectangle Click in one corner and drag to the diagonally opposite corner, 

i 

4 : 

© e zs 

Oval The oval works much like the rectangle and the square, except that the 

rectangular marquee you drag circumscribes the oval. Press the mouse 
button in one corner of the rectangle circumscribing the oval you want, 
drag to the diagonally opposite corner, and release the mouse button. 
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Graphing 


Circle 


Line 


Text 


160 


Drawing tools 


The circle works similarly to the oval, except that SYSTAT does not in- 
scribe the rectangular marquee you drag. Instead, it draws the circle that 
has the height (vertical diameter) of the rectangular marquee you drag. 





To draw a line, click the line tool. Position the pointer at one endpoint 
of the line you want to draw, press the mouse button, drag to the other 
endpoint, and release the button. To add another line segment, press 
the button again at the intersecting endpoint, drag to the next endpoint, 
and release. 





* 


To add text to a graph, click the text tool. Click at the spot in the graph 
where you want the text to start (the left end for horizontal text, the top 
end for vertical text). A dialog box lets you specify text and font. 


e Type the text in the [Text] box 
© Choose a color 
° Choose a style 
© Choose an angle for placement 


© Choose a font and size 
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Drawing tools Graphing 


Po 


{] Plain @ Black © Green © Magenta 
(_) Bold O White © Blue © Yelow 
(J Shadow © Red ©Cyan 


[1] Underline © 0° © 90°O 180° O 270° 


[| Outline Font: 


[_] ttalic aap 
N Helve tic!:2: 


NB site 


New Cent 





To change the content, font, size, or style of any text already present, - 
double-click the text object with the pointer. ‘The text dialog box ap- 
pears and lets you make your changes. 


Scale labels can be altered with plot tools if you check the [Graph scales 
as editable text] box in Options/Global settings... from the Graph 


menu. 


cee gh 


: Pointer You can move graph objects with the pointer. First, click any object to 
| select it. Then you can move the object, resize it, remove it, or change 


its color. 


». 


Moving objects 
To move the object, position the pointer inside the object, press the 
mouse button, drag the object to another location, and release the 
mouse button. 


STE east oe 


Lt 


peta iy ee eae ane 


Resizing objects 
Notice that when you select any non-text object, you see a gray “resize 
oe ; box” in the lower night corner of the marquee. Drag the resize box to 
change the size (and shape) of the object. Resizing with the gray resize 
box is illustrated in the first section of this chapter. 


Removing objects 
To remove a selected object, select Clear from the Edit menu. 
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Graphing 


Colors 


Wine 
Red 
Green 


Blue 
Cyan 
Magenta 
Yellow 





Hints 
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Drawing tools 


To change the color of any object, select a color from the toolbox (with 
the pointer tool) and then click inside the object. Recall that to control 
colors of the graph itself, you must use Options/Color... and/or the 
[Colors] option in the graph’s dialog box. 


Note that you cannot change the color of a graph with the plot tools. 
You must use Options/Color... or the [Colors] option in the graph’s 
dialog box before plotting. You may change the color of graph scales if 
you had checked [Graph scales as editable text] in the Options/Global 
settings... before plotting. 


If you have several objects on top of each other (or inside each other, or 
overlapping), you may have trouble selecung the one you want. This 
happens because your click inside one object may be interpreted as a 
click inside another object “on top” of it. 


Think of objects as solid and layered. When you click at a given point, 
you will get the most “recent” object (i.e. the one that was added last) 
that contains that point. 


To get at the objects “underneath” more recent objects, try clicking in 
different areas of the object. If you sull have trouble, you may need to 
move the more recent objects temporarily out of the way. 


Ow tn - 
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9 Exploratory data analysis 





Straightening scatterplots 166 
SPLOMs 166 
Straightening relationships __ 166 
Outliers 170 

Spinning 171 
Spin 171 

Home 172 
X,Y,Z 173 
Degrees 173 
Labels 173 
Background 173 
Eye 174 

_ Instant graphs 174 
Brushing tools 174 
Tutorial 174 

Scatterplot brushing 182 

Left arrow 183 
Clicking on points ~ 183 
Clicking in white space 184 

Marquee 184 

Lasso ‘ 185 

Subset analyses 185 
Algebraic selection 186 
Turning subset selections off 186 

Extracting subsets 186 


Tutorial 187 
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Overview 


© 1989, SYSTAT, Inc. 


Exploratory data analysis 9 


The integration of SYSTAT and SYGRAPH into a single environment 
is an outstanding tool for exploratory data analysis, or EDA. A field 
pioneered by John Tukey, EDA ts, in his words, “detective work— 
numerical detective work—or counting detective work—or graphical 
detective work.” (Tukey, 1977). 


In EDA, one examines data, particularly through graphing, to uncover 
as much as possible about the relationships lurking within. If one sees a 
direct relationship, one doesn’t stop at a linear regression fit; one tries 
changing the scales to find a straighter plot. If one sees a particularly 
scattered plot, one smooths the value-to-value aberrations to see the 
general trends more clearly. If one sees a curved cluster of points, one 
tries logging and powering scales to straighten and narrow the cluster. 


One works by trial and error, expecting that some trials will produce 
nothing helpful, but also expecting that one technique or another will 
elucidate the relationships among the data. SYGRAPH’s graphing pro- 
cedures, scatterplot brushing, data extraction, and smoothing offer 
much to the explorer. 


This chapter assumes basic familiarity with the program. Please refer to 
earlier chapters and the SYSTAT and SYGRAPH volumes as necessary 
to refresh your memory on the mechanics of the program. The indices 
for each volume may be helpful. Instruction for two new techniques, 3D 
spinning and scatterplot brushing, ave presented in this chapter. 


The chapter concludes with an EDA tutonal. 
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Straightening scatterplots 





SPLOMs In most situations, your first plan of attack should be to look ata 
SPLOM of all your variables. Looking at a SPLOM is a quick way to 
see what relationships might exist among your variables, because you see 
a tiny scatterplot of every possible pairing of the variables you select. 
For exploratory purposes, you may prefer a full SPLOM (with both 
upper- and lower-diagonal plots), since you get the chance to see each 
variable as both dependent and independent. Even though the plots on 
either side of the diagonal differ only by transposition, seeing both 
versions might encourage visual conclusions. 


Look at the shapes of the point clouds in each cell. Are the clouds foot- 
ballish? Are the points ughtly or loosely clumped? Do they line up on 
rows or columns? Does it look like a wet dog shook itself off and left a 
random splatter? 


At the same ume, consider your data. Do your variables record stimuli 
and responses? Different characteristics of a group? Survey results? ‘ 
Frequency distributions or grouped distributions? 


If your predictor variables are categorical (grouping variables), you 
should have points stacking up horizontally or vertically in several lines. 
You may also think of such scatterplots as being like plots of continuous 
variables with slices removed. If both variables are continuous, the points 
will be scattered throughout the space more haphazardly. In either 
case——continuous or categorical—the same principles in searching for 
relationships apply. 3 


i ee 


Straightening To support any directional hypothesis, you ought to be able to fit a e- 

relationships gression model to the data. In other words, if you think Y is some func- 
tion of X, you ought to be able to get the points to straggle along a line 
of some sort. 


For any of this to be relevant, your data must be continuous. If the van- 
ables are categorical (grouping variables), perhaps you should be com- 
paring statistics from group to group. 
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Straightening scatterplots Exploratory data analysis 


The simplest relationship looks like a straight line in a scatterplot. For 
instance, if Y is always 4X, the points line up. The difference between 
successive values (if sorted from smallest to largest) is additive; i.e. if you 
add some value to the first point, you get the second point, and adding 
the value again should give the third point, and so on. The equation of 
the line is Y = »#X + b, where m is the slope of the line (rise over run), 
and b is the y-intercept (the value where the line crosses the y-axis). 


If the line slopes the other way, the slope 7 is negative and the series is 
subtractive; e.g. subtracting something from the first point gives the sec- 
ond, etc. 


Both additive and subtractive slopes are /inear. 


Linear equations have attractive features for statistical modeling, so if 
the data are curved, you generally should try to see whether they can be 
made linear before using ordinary regression. And remember, using lin- 
ear regression on untransformed curved data is absolutely invalid. 





re 
es If the line is a curve, the series may be multiplicative, where one term is 
& a given factor times the previous term. Try logging the variable. Since 
i logarithms reduce muluplication to addition (or division to subtraction), 
} replacing the values with their logs changes the curve to a line. Then, 

: you can express a direct (linear) relationship of one variable to the log of 
a another. 

i 

4 
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Exploratory data analysis Straightening scatterplots 


For example, a plot of MILEAGE against DISPLCMT (from the 
CARS dataset) has a distinctive curvilinear relationship: 


50 


30 


MILEAGE 


20 










10 
O 100 200 300 400 500 


DISPLCMT 


Which variable should you log? While it is preferable to know the an- 
swer before you look at your data, there is nothing wrong with trying 
transformations until the data look linear. 


First, create two new variables called LOGMIL and LOGDIS, which 
are the base ten logarithms of MILEAGE and DISPLCMT, respec- 
tively. You can do this with Math... from the Editor menu by setting 
LOGMIL to LOG(MILEAGE)/LOG(10) and LOGDIS to 
LOG(DISTANCE)/LOG(10). Then do a SPLOM of all four variables: 
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Straightening scatterplots Exploratory data analysis 





MILEAGE versus LOGDIS remains curved while LOGMIL versus 
DISPLCMT is only slightly better. LOGMIL versus LOGDIS shows a 
trace of the original curve but clearly provides the most linear relation- 


ship. 


You might also try some of the data transformations offered by 
Series/Transform.... Again, think about the data and make some edu- 
cated guesses. Logging reduces multiplicative change to additive. 
Demeaning removes height: it moves the whole cloud down to center 
around y=0. “Square” squares each value (or undoes square roots). 

~ Trend,” “Percent Change,” and “Difference” remove slopes, thus 
“untlung” clusters. 


PRY HES 


: ie 2 Se 
ce. {aid 


Logging is not the only transformation to consider for making curved 
relationships linear. Tukey and others have shown that “unbending” 
transformations form a hierarchy: 


3 2 5 ee eee 
eh ph gE Xk sIOCK) 2 ok 6k ows ans 


You may want to try some of these if logging doesn’t unbend enough or 
if it unbends too much. For example, you can examine how inverung 
MILEAGE (creating a new variable GALLONS that is 1/MILEAGE) 


removes even more curvature than logging MILEAGE. 
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Exploratory data analysis Straightening scatterplots 


Outliers Do you have stray points even after, possibly, transforming your data? 


If you have extreme outliers (points that don’t seem to fit in), they’re 
going to distort the scale and thus confuse the picture. Here is a plot 
from the BRAIN dataset of the base 10 log of brain weights versus the 
base 10 log of body weights. Notice that the two points near the top of 
the plot stretch out the scale and hinder us from examining the rest of 


the data. , 
1 
| 6000 ! 
4000 
o 
Zz 
< 
5 
2000 
fe) 
-2 0 2 4 6 
LBODY 


If you see a relationship that might make sense but a few points distort 
the picture, remove the points and look at what remains. This way, you 
can zoom in on more detailed variation. 


To remove outliers or look at subsets of the data, you might want to use 
SYSTAT"s scatterplot brushing and extracting techniques, discussed 
below. 
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Spinning 
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3-D Plot/Spin from the Graph menu lets you rotate three-dimensiona] 
scatterplots on all three axes in real time. Spinning plots in three di- 
mensions often gives you a chance to see your point cloud from un- 
derneath, from behind, from the side—from literally any angle. Such an 
Opportunity can often give you insights into your data that you would 
never get from seeing just one perspective plot. 








Pilot Prob’y | Quantile 
Bef 
EB ie et 


We will use the CARS dataset to show how Spin works. 





¢ Select 3-D Plot/Spin from the Graph menu 
Select the Z variable: MILEAGE 

¢ Select the Y variable: WEIGHT 

¢ Select the X variable: DISPLCMT 

® Click OK 


17] 


Exploratory data analysis spinning 


CARS 
MILEAGE 
HEADROOM 
REARSEAT 
TRUNKSPC 


WEIGHT 


H |DISPLCMT| 





{ 





The 3D Window has a number of tools along the top and side. 
Home Clicking the Home button returns the plot to its original orientation, 


Yan such as you see in the illustration above. You can click back home as 
often as you like. } 
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Spinning Exploratory data analysis 


Fa ew A You can spin the plot on any of the three axes by pressing the arrow 

TH] > buttons on either side of the X, Y, or Z buttons. For as long as you hold 

aly], the mouse button while pointing at an arrow, the graph rotates on the 
corresponding axis. To rotate in the other direction, use the opposite ar- 


4 [2}>, row. 


You can switch from axis to axis or from one direction to another as 
much as you want. 


By clicking the middle portion of the button (the actual X, Y, or Z), you 
can remove the corresponding axis line from the plot. 


Use the degrees control to change the rate at which the plot spins. 
Notce that the spinning is accomplished by animation: the plot is 
shifted from one position to the next quickly, creating the illusion of 
‘motion. The default is for the plot to shift ten degrees at a time, but you 
can make it spin more smoothly and slowly by decreasing the shift, or 
faster and jerkier by increasing the shift. 













To decrease the amount of shift, press the leftward arrow button; to in- 
crease, press the rightward arrow. You can set the shift amount to any 
even degree value between 2° and 90°, 


Notice that the three axes are labeled with their variables, MILEAGE, 
WEIGHT, and DISPLCMT. You can remove any or all labels from the 
plot by clicking the X, Y, or Z of the [Labels] option. If you click a label 
off, its letter on the option is dimmed; to restore the label, click it back 





on. 
Background By default, Spin draws a black plot against a white screen. If you prefef, 
y you may have a white plot against a black screen. Just click the 


black/white button to switch back and forth. 


If you use Options/Color... from the Graph menu to set background, 
foreground, graph, and label colors, Spin plots with the colors you set. 
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Exploratory data analysis Spinning 


After you have spun the plot and found what you consider to be the op- 
timum perspective, click the [Eye] button. The X, Y, and Z boxes show 
what settings to use for the [Eye] option in Options/Formats... so that 
you can print a final, high resolution 3-D Plot for presentation. 


ls i: Alle i ee =, = - 


If you, like many people, find the [Eye] option in Options/Formats... 
confusing, just stick with Spin. After you’ve found the orientation you 
like, check the [Eye] coordinates, set them, and then whip out a final 
copy with 3-D Plot. 





Instant graphs The arrow button at the far nght leads to a pop-up menu providing in- | 
stant histograms, box plots, or probability plots. After spinning a plot of 
three variables, you might want to see quick histograms—or box plots— 
or probability plots—of each variable. Rather than having to go to the 
Graph menu and play with a bunch of dialog boxes, you can just use the 
pop-up menu. Point at the arrow and press the mouse button. A little | 
menu pops up; select the item you want and release the mouse button. } 


Cerrerr of DISPLCMT WEIGHT MILEAGE 
Box plots of DISPLCMT WEIGHT MILEAGE 
Probability plots of DISPLCMT WEIGHT MILEAGE 









SYGRAPH does the rest. All you do is decide whether to stack or over- 
lay the graphs in the View window. 


Brushing tools There are three scatterplot brushing tools available if the Data Editor is 

| open when you use Spin. These tools work the same as the brushing : 
tools in the regular View window. See the “Scatterplot brushing” sec- 
tion, next, for details. 





Tutorial Above we showed the orginal orientation of our MILEAGE by 
WEIGHT by DISPLCMT plot. No, you don’t have to flip back to it. 
Here it is again: 
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Spinning Exploratory data analysis 


We often orient plots a certain way to suggest causality: in an X-Y scat- 
terplot, we plot the predictor as X and the response as Y. In an X-Y-Z 
scatterplot, we might have two predictors as X and Y and a response 
variable as Z. This custom often leads us to infer a causal relanonship 
when we see a 3-D plot. We look for a way to interpret the vertical vari- 
able as some function of the “floor” variables, X and Y, 


In fact, the axis assignments are as likely as not to be arbitrary. This ex- 
ample ts a good case in point. Most of us know that fuel efficiency, ve- 
hicle weight, and engine displacement are al] related characteristics, so 
we chose those variables for the example. We then asked you to plot 
those three variables against each other. Since we didn’t have any pet 
theory to bang you over the head with, we su ggested an order arbitrar- 


ily. 


So let’s question the axis assignments right away. Use the degrees con- 
trol to set the shift amount up to 90°. Now, click any of the X, Y, or Z 
buttons by turns to flip the plot. This amounts to switching axis assign- 
ments. Here are some flips: 


EIGHT 


oH Rr he 


eh a fey 





oles ae 
* “fe 





You can try some of your own. In the upper nght plot notice that the 
DISPLCMT axis is invisible; this is because it is approximately perpen- 
dicular to the “plane” of the monitor screen or the page of this book. 
When you're spinning the graph, you can see more easily whether the 
axis 1s pointed toward or away from you. 
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Expioratory data analysis Spinning 


You've probably already noticed something else. What used to look like 
a ught lictle glob of points floating in space is clearly not. In a few of the 
plots above, the point cloud seems to be spreading across a sort of 
curved surface. We can try to see this more clearly by spinning along 
one axis at a ime in smaller increments. 


So, use the degrees control to set the shift back down to something 
small—say, 6°. To make it easier for you to follow along, we’ll also click 
Home to start fresh again. 


Now, press the mouse button on the leftward X arrow. What we’re go- 
ing to do is make MILEAGE perpendicular to the screen so we can look 
at WEIGHT and DISPLCMT. As it spins, you can see the following 


come into view: 
LE AGE 
t ISPLCMT 


EIGHT : 


We see an almost linear relationship between WEIGHT and 
DISPLCMT. That makes sense: bigger cars have bigger engines. We 
might want to try some transformations later to see if we can straighten 
it up a bit. Notice that the points fan out somewhat as they go away 
from the vertex of the three axes. We might also try to bring that under 
control. 


Now, let’s look at MILEAGE and DISPLCMT. Click Home again, and 
aim for this orientation. We used the leftward Z, the leftward X, and the 
leftward Y axes each briefly, in that order; you could get a similar orien- 
tation with a different sequence of moves. 
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Spinning Exploratory Oata analysis 


Anyway, MILEAGE and DISPLCMT are not so easy to interpret. 
First, the clump is “upside-down;” we generally like to see relationships 
that are more like Y=X. We could fix that problem by using the recipro- 
cal of MILEAGE, GALLONS, like we did earlier. 


Also, notice that the points are scattered more randomly than we might 
hope. We might try transforming the variables to find a more linear 
shape. 


Again, what we see makes sense. Larger engines (which we might casu- 
ally associate with bigger cars) generally are less fuel efficient, but there 
are big cars that are efficient and little cars that are not. Other factors 
are involved: actual car size (WEIGHT), number of cylinders, type of 
transmission, and so on. 


Finally, let’s look at MILEAGE and WEIGHT. We'll let you figure out 
how to spin to an orientation like the one below. 


a. oe eet 


- . 
ote Me 
. 





Tee. 2: 


We see the same problem as before: we’d rather have the inverse of 


MILEAGE. 
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Exploratory data analysis Spinning 


By now you're probably wondering why we didn’t just look at a SPLOM 
if we wanted to see each pairing. Good question. Let’s do a SPLOM 
now. Since we’re exploring, we'll draw a full symmetric SPLOM. 


RILERGE 





This is unquestionably more efficient. But what we don’t see in the © 
SPLOM are angles “above” and “below” and “beside” the cloud of 
points. You know a shape better when you see it from all sides. When 
you were spinning, perhaps you noticed some views like these: 
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Spinning Exploratory data analysis 


ISPLOMT 





ISPLOMT 
o° TILE AGE 

i Another thing we don’t see in the SPLOM is the motion. Sometimes it 
: is the spinning process itself that is most enlightening: by seeing a two- 
all dimensional representation of three dimensions “in motion” we are 
$ better able to construct a three-dimensional image cognitively. (It is 
. easier for most people to perceive depth from a motion picture than 
. from a snapshot.) 


If you want to mislead your audience, you might find a view like this 
useful: 
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Exploratory data analysis Spinning 


Here, we clicked off one axis label that was in the way: 


ISPLCMT 





PILE AGE 


Here are some views without the axes. Removing the axes might free 
you to visualize some other axes, such as might be produced by principal 
component analysis. (In fact, you might want to try spinning your 
Factor results.) 
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Spinning Exploratory data analy sic 


You might look for the orientations that maximize or minimize spread 
along one (computed or visualized) component. 
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Scatterplot brushing 





Scatterplot brushing allows you to isolate data points visually. These 
scatterplot brushing tools appear in the upper left corner of the View 
window any time you draw a scatterplot or a SPLOM with the Data 
Editor open. 


ee ee ee re 





You can use the left arrow to select a point and see which case generated 
it by looking in the Data Editor, or to select one or more cases in the 
Data Editor and look in the plot to see the point(s). You can use the 
lasso or marquee to select a group of points. You can then do analyses ! 
on just those cases, or save those cases in a new file for analysis later. ‘| 


a 


This section gives instruction for each of the tools. The final section, 
“Tutorial” is a walk-through demonstration of how to use scatterplot 
brushing on the BRAIN dataset that compares brain weights and body 
weights of a variety of animals (from tny rodents to huge dinosaurs). 


ee a ee 


Problems? | 
I don’t see the brushing tools. 


1) Be sure to open the file in the Data Editor. The brushing tools only 
work when the Data Editor is open. 


me. tee 


2) The brushing tools disappear whenever you delete a row or case from 
the Data Editor, change or delete a data point, resize the View window, 
or resize or relocate the graph. 
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Scatterplot brushing Exploratory data analysis 


I can’t use the resize box or move the graph. You need the regular 

pointer cursor to move or resize graphs. Just click off the currently se- 
lected tool, and you have the pointer. Remember, though, if you resize 
or move the graph, the brushing tools disappear. Position and size the 

graph before plotting with the ruler option. 


Left arrow The left arrow has two funcnons. First, you can click on a point in a plot 

or SPLOM to see which case generated it and, in the case of a SPLOM, 
which points in the other plots of the SPLOM were generated by the 
same case. Second, you can click in white space of the View window to 
highlight the points generated by the cases that are selected in the Data 


Editor. 
Clicking on You can click a point in the plot to highlight its case in the Data Editor. 
points If you have a SPLOM, the points generated by that case in other plots 
of the SPLOM are highlighted for as long as you hold the mouse 
button. 


e Be sure the View window is active 
e Choose the left arrow tool 
@ Click on a point 


Now, look in the Data Editor. The case that generated the point is 
highlighted. 


Thiet Re BEA. Sooke B ; s ee 


You may want to move the View and Editor windows so that you can 
see the Data Editor clearly even when the View window is the active 
window. 





_ Problems? 
I don’t see any highlighted cases. 


Tee ' Ba 8 

-T ‘kt 4 < . - a 
P ? aT) % aL 3 
ba \ ice : 


1) Be sure that you clicked om a point and not near it. 


2) If you have several points clustered closely together, you may have to 
try a couple times to get the one you want. If it is too difficult to isolate 
a point, try selecting a group (discussed below) or adjusting [X min], 

[X max], etc. with the [Axes] option and replotting to “zoom in” on the 

area you re interested in. 


3) If the point came from a case that is not currently visible in the Editor 
window, you may need to scroll up or down. 
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Exploratory data analysis scatterplot brushing 


4) If you had selected any cells in the Editor before using the left arrow, 
the case is highlighted in that column—the “current” column. Decide 
which variable column you would like the left arrow to highlight, click 
somewhere in the column to make it the “current” column, and try 
again. 


Clicking in white The second function of the left arrow is to see in the plot what cases are 
space currently highlighted or bulleted. 


e Highlight a case by using the left arrow, bullet points with the mar- 
quee or lasso, or select cases in the Data Editor 

e Choose the left arrow tool 7 

e Press the mouse button anywhere in the View window except ona 
point—that is, click on white space 

e When you are done studying the highlighted points, release the 
mouse button 


If you want to see the points for a longer time but don’t want to hold 
onto the mouse button, hold the Option key and click the left arrow. 
The points will blink on and off unul you click again. 


Problems? 
I selected a bunch of points, but only one point is highlighted. Be 
sure that you are clicking on white space. If you click on a point, only 
that point (or that point in all plots of a SPLOM) is highlighted. Click 
outside the plot to be sure. 


The wrong points (or no points) are highlighted. As we discuss be- 
low, you “deselect” points by clicking the lasso or marquee in the View 
window. If you clicked in the window before choosing the left arrow 
tool, you may inadvertently have turned off or changed the selection. 
Also, be sure you are not clicking the left arrow on a point. 


Marquee Use the marquee to make rectangular selections of points. Any points | 
you select in the plot are marked with round bullets (e) in the Case col- 
umn of the Data Editor. 


e Choose the marquee tool 

e Press the mouse button and drag a rectangular marquee around the 
points you want 

e Release the mouse button 

e Look in the Data Editor to see the bulleted cases 


¢ 
i 
: 
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Scatterplot brushing Exploratory data analysis 


You can use the Shift key to make cumulative selections. Make one se- 
lection the usual way, then hold the Shift key while making another. 
Both sets of points are highlighted. 


Once cases are bulleted in the Editor, any procedures you perform are 
limited to those cases. So, you can use the marquee to select a subset of 
cases, do Stats and Graph analyses of the subset, and then return to the 
full dataset. More on this under “Subset analyses.” 


To turn off a selection (bulleting), just click the marquee anywhere in 


the View window. 
Lasso Use the lasso to rope irregularly-shaped clusters of data points. Any 
points you select in the plot are marked with round bullets (¢) in the 


Case column of the Data Editor. 


e Choose the lasso tool 

e Press the mouse button and draw a rope around the points you want 
® Release the mouse button 

e Look in the Data Editor to see the bulleted cases 


If you want to isolate several remote clusters, either draw a ticky rope 
that doesn’t capture extra points or make cumulative selections. 


To make cumulative selections, make one selection the usual way, then 
hold the Shift key while making another. Both sets of points are high- 
lighted. 


Once cases are bulleted in the Editor, any procedures you perform are 
limited to those cases. So, you can use the marquee to select a subset of 
cases, do Stats and Graph analyses of the subset, and then return to the 
full dataset. More on this under “Subset analyses.” 


To turn off a selection (bulleting), just click the marquee anywhere in 
the View window. 





Subset analyses Once cases are bulleted in the Data Editor, you can do further analyses 
with just those cases. For instance, you could use Corr to find the 
Pearson correlation coefficients of just those cases; thus, you could remove 
the effects of outliers on the correlation of an otherwise convincing 
cluster. 
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Exploratory data analysis Scatterplot brushing 


Algebraic 
selection 


Turning subset 
selections off 


Extracting 
subsets 
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Or, you could use Plot again and see just those points more closely: 
since you reduce the range of values, the scale becomes smaller. You’re 
effectively “zooming in” on that part of the graph. You could then doa 
further selection by using the marquee or lasso on that plot to narrow 
the field sull more. 


Editor/Select... lets you to select cases algebraically. This is discussed 
in the Data Editor chapter. 


Any bullet selection remains in effect unul you make a different selec- 
tion, click the marquee of lasso in the View window to turn the selection 
off, select Remove select from the Editor menu, open a new file, 
transform a case, enter a new value in the Editor, or delete a case. 


You can extract bulleted cases into a separate file with Extract... from 
the Editor menu. 


e Use the marquee, lasso, or Editor/Select... to select cases 
e Select Extract... from the Editor menu 

e Specify an output filename and location 

® Click Save 


All the bulleted cases are placed in the new file. You can then open that 
file and do further analyses on just those cases. 
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Tutorial 


Perhaps the best way to introduce scatterplot brushing is with an exam- 
ple. The BRAIN file is a sample dataset recording the body weights (in 
kilograms) and brain weights (in grams) of various species. 


Before we start, we know certain things. Dinosaurs and elephants are 
big. Very big. Rats are not. But are dinosaur brains as large, propor- 
tionally, as elephant brains? Are rat brains as small as they would be if 
Tats were uny dinosaurs? 


Let’s look at a scatterplot of BRAIN G against BODY KG: 
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Exploratory data analysis Tutorial 


Right away, we know a few things. First, we have three points that are 
way out of range. Choose the left arrow tool and click the topmost out- 
lier, then click on the Data Editor to make it active. 


SYSTAT Vier: . 
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We see that this huge-brain critter is the African elephant. Clicking on 
the other two outliers reveals a huge-brained Asian elephant and a big, 
stupid Brachiosaurus. All three are out of range similar distances 
(visually), but note that elephants have disproportionately large brains 
but body sizes fairly well in range. The Brachiosaurus, contrarily, has a 
huge body but an embarrassingly small brain. 


The remaining points represent animals whose brain/body weight ratios 
are closer to the average. However, nouce that we see only about a 
dozen points on the plot, yet we have a whole bunch of cases. We must 
conclude that the lower corner of the graph has points stacked on top of 
each other. 


Choose the marquee tool and pull a marquee around the lower clump. 
After a pause, the selected points blink and their cases are bulleted. Any 
analysis we do now will use only those bulleted cases. 
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Tutorial Exploratory data analysis 
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Use Plot again. (It might be easier to use Redo last analysis, abbrevi- 
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Exploratory data analysis Tutorial 


We sull have a ught concentration in the lower left commer and a bunch 
of brainy or overgrown outlying animals. Again using the left arrow 
tool, we see that humans, horses, giraffes, chimpanzees, donkeys, goril- 
las, and cows are disproportionately large brained. The latter five points 
are clumped closely together—you may need to fuss a little bit and point 
carefully to find out all the mammal types where the points stick to- 
gether. Or, you could use the command key (38) and the marquee or the 
lasso, and then look at the cases that are bulleted for extraction. 


By now, though, you should have made an important observation: the 
data are ughtly scrunched together near the origin and fan out increas- 
ingly as BRAIN_G and BODY_KG increase. In fact, the increasing 
amount of space looks almost—could it be?—multiplicative in both di- 
recuons! 


You should be thinking, “What if I take the logarithms of the vari- 
ables?” 


What, indeed? Try it. Use Editor/Math... to create new variables 
LBRAIN and LBODY: 


set LBRAIN to LOG(BRAIN_G)/LOG(10) 
Set LBODY to LOG(BODY_KG)/LOG(10). 


Now, look at a SPLOM of all four variables: BRAIN_G, BODY_KG, 
LBRAIN, and LBODY. Recall that a SPLOM is almost always the best 
bet when you’re beginning any analysis, because it shows you every 
possible pairing of variables. If you didn’t use a SPLOM, you might 
start with a plot of LBRAIN and BODY_KG. As you can see in the 
SPLOM, even logging just BRAIN_G cleans up the plot a lot, and you 
might be content to stop there. The SPLOM, however, makes it clear 
that logging both variables lends the clearest relationship. 
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Exploratory data analysis 





Now we plot BRAIN_G against BODY_KG using the [LOG] option to 
log both variables automatically. 


e Select Plot/Plot from the Graph menu 

¢ Select the Y variable: BRAIN_G 

e Select the X variable: BODY_KG 

° Click [Log] and specify 10 for [X log] and [Y log] 
e Click OK 
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Exploratory data analysis Tutorial 
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We see how scatterplot brushing and extracung can help us see relation- 
ships we might not have guessed otherwise. Once we extracted a few 
outliers, the plot got enough clarity that we could guess more easily thar 


logging might be helpful. 


& 
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At this point, you might want to continue with some more brushing and 
extracting. You’!] notice that the dinosaurs are sull outliers on the un- 
derside of the average brain-to-body proporton, although certainly not 


to the degree they were before logging the data. 





Make your own decisions: for your purposes, is it valid to narrow the 
field to a few millennia worth of animals? If so, then extract away. Or, 


throw validity by the wayside for now, and make some experimental, 
arbitrary extractions and do some stats—see how your results change. 
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Tips and shortcuts 10. 





All the In the Redirecting output chapter, we explained that both the Analysis 
windows are window and the Command window are text editors. You can type in 
text editors them to add your comments. Just place the cursor where you want to 


type something, click the mouse button, and start typing. You can add 
new lines by pressing Return, and you can remove lines and material by 
selecting it with the mouse and pressing Delete. 


(Note that if you are editing in the Command window, you have to be 
careful about prompts; see the Command reference appendices in the 
SYSTAT and SYGRAPH volumes for information about editng com- 
mand batches.) 


You can also Copy from them, Cut from them, and Paste into them. 
You can cut the numbers that some analysis spat out, paste them into 
the Data Editor, and analyze those results. 


If you choose [Scroll analyses] from Data/Formats..., you can let all 
your analysis results pile up in the Analysis window unul you're done 
working. Then, select all the contents of the window (use Select all 
from the Edit menu) and print them and/or save them. You'll have a 
permanent record of your analysis process; if you ever need to recon- 
struct your efforts, it will be easy. 


f | Do you find yourself sticking little notes all over your machine remind- 
ing yourself to double check a data point or rerun some analysis without 
the outlier? Type those notes directly into the Analysis window, and 


they won’t get lost. 
Redo last You can redo your most recent statistical or graphics procedure in one 
analysis step by selecting Redo last analysis from the Goodies menu. Redo re- 


peats the most recent action taken through the Stats or Graph menu. 


For example, after you bullet some points with the scatterplot brushing 
tools, you might want to look at a plot of just those cases. Select Redo 
last analysis, and SYGRAPH automatically draws a new plot, only this 
time it uses only the selected cases, just as it would if you had gone to 

_ the Graph menu yourself. 
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Or, suppose you had done a SPLOM of all the variables in a file and 
then you decided you needed to log a few variables. After you get done 
with Editor/Math..., you’re ready to plot again. Just select Redo last 
analysis. 


In the Analysis chapter we showed how you can save all your dialog box 
settings with the [DLOG] option in the Save as... dialog box. Saving 
dialog boxes—particularly when you’re working on a lot of repetitive 
models with MGLH—can save you a lot of pointing and clicking when 
you go back to work after lunch or the next day or a year later. 


You might want to set up special DLOG folders—one for each dataset 


_ you work with—and include the date in each filename. You can keep a 


dialog box “journal.” 


Also in the Analysis chapter, we discussed variable bundles. Whenever 
you're using a dataset with more than a dozen or so vanables, you will 
probably notice that you use only a handful of variables at a ume. You 
waste a lot of time scrolling through variable selection lists. 


Set up a few bundles of your most frequently needed handfuls of vari- 
ables. Remember, you can switch from one bundle to another or back to 
the whole dataset instantly. i 


If you forget how to run an analysis or produce a graph, you can proba- 
bly find the information you need most quickly with the on-screen help 
facilities. | 


Don’t forget that help is a click away for every dialog box, button, check 
box, selection list, window, statistical term, tool, command,... See the 
Help chapter for details. 





You probably won’t need to use these manuals for weeks at a time after 
you’ve gotten the hang of SYSTAT and SYGRAPH. (We hope you will 
read the manuals, of course—but you shouldn’t have to go digging 
through indexes for instructions on a button!) 








Need to identify an outlier? Extract a subset? Exclude some oudiers? 
Follow some cases across the various plots of a SPLOM? Scatterplot 
brushing tools let you do these things quickly, visually. 
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interface 


Command logs 


Command 
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Instant graphics 
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Tips and shortcuts 


Many of the menu items have command-key abbreviations. Rather than 
having to use your mouse to pull down the menu and select an item, you 
can type its command-key abbreviadon. Command-key abbreviations 
use the Command (Apple or %) key much like the Shift key: hold the 
Command key, press the appropriate letter, and release the Command 
key. Like most Macintosh applications, for instance, you can use 
Command-O (3-O) to open files, #-S to save, etc. 


Command-key abbreviations are shown in the menus along the nght 
side. 


Many people find command interfaces to be more efficient than menu 
interfaces for some types of work. Particularly for repetitive analyses, 
SYSTAT and SYGRAPH’s optional command interface can be helpful. 
See the introductions to the Command references, which appear as 
Appendix I in both the SYSTAT and. SYGRAPH manuals, for more in- 
formation about how and why to use commands. 


A particularly useful feature of the optional command interface is that it 
enables you to keep precise, complete records of every step you took in 
analyzing and graphing your data. Even if you actually use the menus to 
do your work, you can have the Command window open and SYSTAT 
and SYGRAPH will automatically generate a command log of your 
work, At the end of the session, all you need to do is select all the mate- 
rial in the command window and print it or save it. Again, see the 
Command reference appendices for details. 


You can also use the command interface to put together batches of com- 
mands to run all at once—perhaps while you go to lunch. You can put 
batches together easily by editing the command log of a prior session. 
See the Command reference appendices for details. 


After results of statistical procedures are displayed in the Analysis win- 
dow, SYSTAT can suggest some possible next steps—other graphical or 
statistical displays that you might want to check—-and even perform 
them for you. Just hold the Option key to get the magnifying glass cur- 
sor and press the mouse button on any variable name or any cell in a 
matrix. : 
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Tips and shortcuts 


Variable names 


Matrix cells 


Example 
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In any statistical output, if you hold the Option key and press the mouse 
button on a variable name, SYSTAT offers a pop-up menu of graphics 
and statistic that you might want to try. 


This pop-up menu works like a regular Macintosh menu: select the item 
you want and release the mouse button. If you decide you don’t want to 
do any of the suggestions, just move up and off the menu before releas- 
ing the mouse button. 


If you do select an item, SYSTAT does al! the work for you. You don’t 
have to go to the Graph or Stats menus and click your way through a 
dialog box; you just decide whether to overlay or stack graphs. 


In any matrix produced in the Analysis window (such as a matrix of 
Pearson correlation coefficients), you can Option-click any cell to get a 
menu of graphics and statistics for the corresponding pair of variables. 


Again, this menu works like any menu, and SYSTAT does all the work. 


Stats/Statistics... produces the following for the DOCTORS variable 
in the USDATA file: : 





SYSTAT Analysis 





Total observations: 30 
DOCTORS . 
N of cases 30 
Ninimun 345.000 
Max i mum 37225 .000 
Mean 8711.860 
Stendard dev 11148 .827 


Now, if you hold the Option key, the cursor changes to a magnifying 
glass. With the magnifying glass, press the mouse button on the variable 
name DOCTORS. The following menu pops up: 
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Tips and shortcuts 


SYSTAT Analysis 


ad Box plot of DOCTORS 
N of cases | Descriptive statistics of DOCTORS 
BA ees | Frequency counts of DOCTORS 


Max imum 


Mean Histogram of DOCTORS 
ee Sera Probability plot of DOCTORS 


Standard Dev 
Quantile plot of DOCTORS 
Stem-and-leaf plot of DOCTORS 


We release the button on the Probability plot of DOCTORS item 
and get the following plot: 
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Next, we use Corr/Pearson... from the Stats menu to produce a corre- 
lation matrix of ACCIDENT, CARDIO, CANCER, and | 








é PULMONAR. We can Option-click the CARDIO, ACCIDENT cell 
} and get a menu of suggested statistics and graphics for CARDIO and 
4 ACCIDENT: 
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Tips and shortcuts 






SYSTAT Analysis 





Pearson Correlation Matrix 









ACC | DENT CARDIO CANCER PULMONAR 




















ACC i DENT 1,000 
CARDIO Bees minds 
CANCER =e: Box plots of CARDIO and ACCIDENT 






PULMONAR ~0.q Histograms of CARDIO and ACCIDENT 


Influence plot of CARDIO by ACCIDENT 

Plot of CARDIO by ACCIDENT 

Plot of CARDIO by ACCIDENT with regression 
: merece, Probability pilots of CARDIO and RCCIDENT 

Be ee] Regression of CARDIO by ACCIDENT 
Tabulate CARDIG by ACCIDENT 
t-test of CARDIO and ACCIDENT 





ee i Te 












Number of observatians: 








ets he A i ne PO te aime. or 





We select Box plots of CARDIO and ACCIDENT and get the fol- 
lowing box plots, instantly. As with all graphs drawn in the View win- 
dow, you must choose [Overlay plot], [Stack plot], or [Next plot] be- 
tween graphs. 
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If you try to Option-click some other'type of item in the Analysis win- 
dow, SYSTAT reports that it doesn’t know what it is and then offers to 
try Information. 


v0 ogee ath 


SYSTAT Analysis 


Try ‘Information’ on this word? 


N of cases 50 
Minimum 14.000 
Max imum 581.000 
Mean 137.420 
Ver lance 13938 .902 
Standerd Dev 118.063 
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Tips and shortcuts 


Selecung Try information on this word is effectively the same as se- 
lecting the word and selecting Information from the Goodies menu. 
Information gives a brief review of the term or statistic selected. It is 

discussed further in the last section of the Help chapter. 
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Tips and shortcuts 
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ABS function, 69 

Absolute value function, 69 
ACS function, 69 

Active window, 14 
Addition (+), 69 
Aggregated data, 88-90 
[And] (see Select...) 


Analysis, 122-139 


[Analysis results] (see Formats...) 
Analyzing subgroups, 134-36, 185 
[And], 70 | 

Arc hyperbolic tangent function, 69 
Arccosine function, 69 

Arcsine function, 69 

Arctangent function, 69 

Array, 52° 

ASN function, 69 

ATH function, 69 

ATN function, 69 

[Autosave dialogs] (see Formats...) 
AVG functon, 72 


Batches, 197 

Beta distribution, 73 - 

BCF function, 73 

Blocks of data 
Changing, 60 
Selecting, 60 

[Bold] (see Formats...) 

BIF function, 73 

BRN function, 73 

Brushing tools, 174, 182~93 

Bulleted cases, 186 

Bundles, 130—33, 196 
Defining, 131 
Saving, 133 
Switching, 132 





203 


By groups... Cosine function 


By groups..., 134 
[By groups off], 134 
[Exclude missing], 135 
[New page], 134 


[Cancel], 129 

Caret (4) 

Cases, 46 

Case weighting, 137-39 

Cases, maximum number of, 50 

Categorical variables, 166 

CDF (see ZCF) 

Changing blocks of data, 60 

Changing data entries, 59 

Changing variable names, 59 

Character values, 54 

Character variables, 47 

Chi-square distribution, 73 

[Clear], 129 

Clear, 62 

Clipboard (also see Cut, Copy, Paste) 
Copying or cutting data into, 104 
Placing output in, 120 

Close box, 14 

Color..., 157 

Coloring objects, 162 

Command batches, 197 

-Command help, 38-40 

Command key abbreviations, 15, 197 

Command logs, 197 

Command interface, 4, 197 

Copy, 62, 120 

Copy graph, 120 

Comments on output, 115 

[Complex find...] (see Find case...) 

Continuous variable, 166 

[(Condense] (see Formats...) 

[Correlation] (see Formats...) 

COS function, 69 

Cosine function, 69 
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[Covariance] 
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[Enter down] 


[Covariance] (see Formats...) 
Cross validation, 138 
Cumulative functions,, 73 
Cut, 62 


Data Editor vs. Spreadsheet, 50 
Data Editor, 44-66 
Data files, 92-109 
Data set, 146 
Decimal places 
Shown in output, 125 
Shown in Editor, 82 
[Decimals to show] (see Formats...) 
Default, 25 
Define bundles..., 131 
Deleting cases, 60 
Deleting variables, 60 
Descriptive statistics, 24 
Dialog boxes, 128-33 
Keyboard shortcuts, 128 
Options, 129 
Saving setungs, 130 
Selecting functions, 128 
Selecting variables, 128 
Dimension, 52 
[Dissimilarity] (see Formats...) 
Distribution functions, 73 
Division (/, 69 
([DLOG] (see Save as...) 
Documentation, overview of, 9 
Double precision, 96 
Dragging objects, 142-22, 148, 159-61 
Drawing tools, 158-63 
Double precision, 96 


ECF function,73 
Editing data, 59-64 


_EIF function, 73 


[Enter down] (see Formats...) 
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[Enter mght] 
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Formats.../ iCondense) 


[Enter nght} (see Formats...) 
Entering data, 48 
Entering variable names, 51 
Erlang variate, 74 
Equal to (=), 70 
ERN function, 73 
Evaluation of expressions, 76 
[Exclude missing] (see By groups...) 
Exclusive option, 129 
EXP function, 69 
Exploratory data analysis (EDA), 164-93 
Exponential distribution functions, 73 
Exponennal function, 69 
Exponentiation (4), 69 
Exporting data, 104-05 

Copying into the clipboard, 104 

Saving text files, 104 
[Extend] (see Formats...) 
[Extended results], 125 
Extracting subsets of data, 186 
Extract selected... , 84 
Extreme value variate, 74 
Eye, 174 


F distribution, 73 
FCF function, 73 
FIF function, 73 
Filenames, 94 
Fill worksheet..., 76-77 
Filling variables with random data, 77 
Find case..., 78-81 
[Complex find...], 78 
[Test variable], 78 
[Variable or expression], 78 
Find next, 78-81 
Floppy disk installation, 5 
Formats... (Data), 124-126 
[Analysis results], 124 
[Bold], 124 
[Condense], 124 
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Formats.../[Decimals to show] 


[Decimals to show], 125 

[Extend], 124 

[Extended results], 125 

[Graphics], 125 

(Italic}, 124 

(Oucline], 124 

[Menu click], 125 

[Plain], 124 

[Scroll analyses], 126 

[Size], 126 

[Shadow], 124 

[Standard] results, 125 

(Text font], 126 

[Underline], 124 
Formats... (Editor), 81-82 

[Correlation], 82 

{Covariance], 82 

[Dissimilarity], 82 

[Enter down], 82 

-[Enter right], 82 

[Rectangular], 82 

[SSCP], 82 2 
Format of numbers (see Formats...) 
Frequency distributions, 73 
FRN function, 73 
Functions, 68-76 


Gamma distribution, 73 
Gamma variate, 73 
GCF function, 73 
GIF function, 73 
Graph menu, 19-20 
Graph options, 157 
Color..., 157 
Formats..., 157 
Global settings..., 157 
Graphic output, 110-20 
Graphing, 140-163 
Graphs, printing, 116 
Greater than (>), 70 
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Greater than 
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Greater than or equal to Less than 


Greater than or equal to (>=), 70 
GRN function, 73 
Grouping variables, 86-88 


[Height] (see Ruler tool) 
Help, 5, 28-33, 82, 196 
Command, 38-40 
Dialog, 30, 33-34 
Editor help, 37 
General, 30-33 
Instant, 30, 34-37 
Hyperbolic arctangent, 69 


Import..., 100-102 
Importung data 
From Excel files, 102 
From map files, 103 
From text files, 100 
Pasung from the clipboard, 99-100 
Inequalities, 70 
Information, 41-43 
Installing SYSTAT and SYGRAPH, 5 
Instant graphs and statistics, 174, 197-201 
Example, 198-201 ' 
Matrix cells, 198 
Variable name, 198 
INT function, 69 
Inverse functions, 73 
Integer truncation function, 69 
Inverse distribution functions, 73 
[Italic] (see Formats...) 


. ee a ean ny? 





Keyboard shortcuts, 128 


’ [Large screen] ourput, 117 
Lasso tool (see Scatterplot brushing) 
Left arrow tool (see Scatterplot brushing) 
Less than (<), 70 
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Less than or equal to Not equal to 


Less than or equal to (<=), 70 

LGM function, 69 

Linear, 167 

LOG function, 69 

Log gamma function (see LGM function) 
Logical expressions, 70-72 

Logical operators, 70-72 


Macintosh basics, 13-15 
Manipulating data files, 106-07 
Map files, reading, 103 
Marquee tool (see Scatterplot brushing) 
Math..., 65-66 
Matrix input, 82 
MAX function, 72 
Maximum file size, 50 
Menu, 13 
Menu bar, 13 
[Menu click] 
MIN function, 72 
MIS function, 72 
Missing values, 54-56 
Moving graphs, 148 
Movie..., 151-56 

Resolution, 151 

Time series example, 152~56 
Multi—variable functions, 72 
Multiplication (*), 69 


Naming data files, 94 

Naming variables, 51-53 
Natural logarithm function, 69 
Nested sort, 107 

New page (see By groups...) 
[Next plot], 151 

Nonexclusive options, 129 


‘Normal distribution, 73 


[Normal] output, 117 
[Nort], 71 


Not equal to (<>), 70 
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Notes... Pasung from the clipboard/Variable Hannes 


Notes..., 113 

NRAN (see ZRN) 
Numeric variables, 47 
Numerical values, 53 


Object oriented, 158 
Odometer, 66 
One integrated environment, 4 
Opening a SYSTAT data file, 18-19 
Opening the Data Editor, 48-50 
Operating ups, 194-202 
Operators, 68-76 
Options (Graph menu), 157 
Color..., 157 
Formats..., 157 
Global settings..., 157 
Optons, 129 
Exclusive, 129 
Nonexclusive, 129 
That take arguments, 129 i 
[Or], 70 : 
Order of expression evaluation, 76 
Organization of SYSTAT, 99 
Outliers, 170 
[Outline] (see Formats...) 
Output, 110-21 
Output device control, 110-21 
[Overlay plot], 150 





Page formats..., 117 

Page preview..., 116 

Page Setup..., 98 

Page utles, 117 

Paging, 14 

Paste, 63, 99-100 

Pasting data from the clipboard, 99-100 
Character values, 99 
Delimiters, 99 
Variable names, 99 
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PICT files 
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Reading Excel files 


PICT files, 112 
Viewing, 112 
[Plain] (see Formats...) 
Plain text formats, 100 
Plot tools and colors..., 158-63 
Circle, 160 | 
Line, 160 
Oval, 159 
Pointer, 161 
Moving objects, 161 
Removing objects, 161 
Resizing objects, 161 
Rectangle, 159 
Rounded-comer rectangle, 159 
Text, 160 
Positioning graphs, 142-49 
By mouse, 143 
By typing, 144 
Predictor variables 
Categorical, 166 
Continuous, 166 
Print data..., 98, 119 
Print graph..., 116-17 ° 
Print selected text..., 118 
Printing a SYSTAT data file, 98-99 
Printing output, 116-19 


Random deviates, 74 
Random distribution functions, 73 
Random number generator, 75 
Random subsample, 94 
Rank..., 108 
Ranking variables, 108 
Large files, 108 
Random variate functions, 73 
Reading a text file, 100-102 
Character values, 102 
Delimiters, 101 
Missing values, 102 
Variable names, 101 
Reading Excel files, 102-03 





Reading map files 


Save selected text as... 


Reading map files, 103 
Reading SYSTAT files, 99 
Recode..., 66-68 
Record preferences, 127 
[Rectangular] (see F ormats...) 
Redirecting output, 110-21 

Saving in files, 112 

Placing in clipboard, 120 

Prinung, 116 
Redo last analysis, 195 
Regression, 26-27, 166 
Relational operators, 70 
Remove select, 84 
Resize box, 148 
Resizing graphs, 144 

By mouse, 144 

By typing, 144 
Results to file as..., 114 
Results to printer, 117, 118 3 
Results to window, 114 s 
RSEED function, 75 : 
Round-off error 
Ruler tool, 142 

[Height], 142 

Marquee, 142 

Resize box, 148 

[Show graph], 143 

[Width], 142 

[X origin], 144 

[Y origin], 144 

(Z height], 144 
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Save..., 56, 94 
Save as..., 56, 95 
[SYSTAT], 95 
[Tabs], 95 
[Text], 95 
[Dlog], 95 
Save graph as..., 112 
Save selected text as..., 114 
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Saved data 
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Sorting variables 


Saved data, using, 115 
Saving data, 56-58, 94-97, 112-115 
Saving data in an textfile, 95, 104 
Delimiters, 105 
Saving data produced by analyses, 115 
Saving dialog boxes, 196 
Scatterplot, 22-23 
Scatterplot brushing, 182-83, 196 
Clicking on points, 183 
Lasso tool, 184 
Left arrow tool, 183 
Marquee tool, 184 
Selecting points, 183 
Tutonal, 187-93 
[Scroll analyses] (see Formats...) 
Scrolling, 14 
Select..., 82-84, 135 
Selections, turning off, 84 
Selecting cases 
Algebraically, 82-84, 186 
Scatterplot brushing, 182-86 
Selecting data, 84-85, (also see Scatterplot brushing) 
Blocks of data, 60 ; 
Columns, 61 
Rows, 61 
Selecting subgroups, 135-36 
[Shadow] (see Formats...) 
Shortcuts, 195-202 | 
Show graph (see Ruler tool) 
Show options in effect..., 126 
[Similarity] (see Formats...) 
Simple data transformations, 64-77 
SIN function, 69 
Single precision, 96 
Size box, 14 
Sizing graphics, 142-49 
Sort..., 106. 
Sorting data, 106-07 
Character order, 107 
Nested, 107 
Numeric order, 107 


Sorting variables, 106 
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Spreadsheer SYSTAT supplements/LOGIT 


Spreadsheet, 50 
Space limits of procedures, 7 
Special data structures, 86 
Spin, 171-81 
Background, 173 
Brushing tools, 174 
Degrees, 173 
Eye, 174 
Home, 172 
Instant graphs, 174 
Labels, 173 
X,Y,Z, 173 
Tutorial, 174 
Spinning, 5, 171-81 
SPLOMs, 166 
[Stack plot], 150 
Standardize..., 109 
Standard results, 125 
Statistics, 24-27 
Straightening scatterplots, 166-70 
Stats menu, 24 
Submit clipboard, 113 
SQR function, 69 
Square root function, 69 
[SSCP] (see Formats...) 
STD function, 72 
Straightening scatterplots, 166~70 
Subgroups 
Analyzing, 134-36 
Selecting, 135-36 
Subscripted variables 
Subset analysis, 185-86 
Subtraction (—), 69 
SUM function, 72 
Supplements (see SYSTAT supplements) | 
(SYSTAT] (see Save as...) Fi 
SYSTAT data files, 7, 91-109 
SYSTAT supplements, 8 
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2SLS, 8 4 
DESIGN, 8 E 
EzPATH, 8 e 
LOGIT, 8 E 
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SYSTAT supplements/PROBIT 
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PROBIT, 8 
SETCOR, 8 
SIGNAL, 8 
SURVIVAL, 9 
_ TESTAT, 8 
TOBIT, 8 


t distribution, 73 
Tab-delimited, 95, 99, 101, 104-05 
[Tabs] (see Save as...) 
[Text] (see Save as...) 
TAN function, 69 
Tangent function, 69 
TCF function, 73 
Tear-off menu, 19-21 
[Test variable] (see Find case...) 
Text editor, 195 
TIF function, 93 
Text file 

Reading, 100-02 

Saving, 104 
[Text font] (see Formats...) 
Text list, 31 
Text output, 111 
Text window, 33 
Thumb, 14 
TIF function, 73 
Time limits of procedures, 7 
Title bar, 14 
Title lines, 118 
Transforming data, 65-77 
Transpose..., 109 
Transposing a file, 109 
Tnangular matrices, 90-91 
Trigonomertric functions, 69 
TRN function, 73 
Turning off selections, 84 . 
Tutorial, 16-27 


‘UCF function, 73 
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UCF function 
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UIF function 
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UIF function, 73 

Unary minus function, 69 

[Underline] (see Formats...) 

Uniform deviates, 73 

Uniform distribution functions, 73 
Uniform random numbers, 73 

URN function , 73 

(Use all variables] (see Define bundles...) 
Using saved data, 115 


Values, 53-56 
Character, 54 
Missing, 54 
Numeric, 53 
Variable bundles, 130-33, 196 
[Variable or expression] (see Find case...) 
Variable names, 51-53 
Changing, 59 
Character, 51 
Entering, 51 
Subscripts, 51 
Variables, 46 
Character, 47 
Numeric, 47 
View window, 111 
Viruses, § 


Weight..., 137 
[Weights off], 138 
Weighting cases, 137-39 
Non-integer, 138 
Saving analyses after weighting, 138 
Windows, 13 : 
As text editors, 113, 195 r 
Worksheet, 46 


[X origin] (see Ruler tool) 
XCF function, 73 
XIF function, 73 
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XRN function 
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XRN function, 73 
(Y origin] (see Ruler tool) 


(Z height] (see Ruler tool) 
7.CF function, 73 

7IF function, 73 

Zoom box, 14 

ZRN function, 73 


ZRN function 
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